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The frequency of heterozygotes for the seven sequence variants ranged from 3-1 1% 
among the 36 individuals tested. Some of the sequence variances appear to occur more 
commonly in certain racial or ethnic groups. For example, heterozygotes for four 
sequence variances (at nucleotides 1059, 1428, 3324 and 3375) were detected solely 
5 or predominantly in North American Blacks, with heterozygote frequencies of 1/4 or 

2/4. The nucleotide 2538 variance was detected solely in North American Whites 
(4/16) and results in an amino acid exchange (see below). The nucleotide 3397 
sequence variance was detected solely in one Japanese individual (of four tested). 
The nucleotide 2538 sequence variant results in an aspartic acid vs. glutamic acid 
10 substitution at amino acid 740 of the 1024 amino acid protein. This residue lies in the 

cytoplasmic loop of the 1 subunit. 

The alpha] subunit of Sodium Potassium ATPase maps to chromosome IpU-pl 1 

15 Tte g en e for the 1 subunit of sodium-potassium ATPase has been mapped to 

chromosome band lpl3-pl 1 by several techniques. Yang-Feng et al. (10) assigned the 
ATP1 Al gene to lp21-cen by Southern analysis of DNA from panels of rodent/human 
somatic cell hybrid lines. This localization was confirmed and refined by Chehab et 
al., who showed that the gene for the ATP1A1 subunit is on lpl3-pll using 

20 hybridization to flow-sorted chromosomes and in situ hybridization (9). 

Chromosome band lp!3~pll is a site of frequent loss of heterozygosity 

The short arm of chromosome 1 is comparatively well investigated for allele loss, 
25 especially in breast and colon cancers, however most of these studies are principally 

concerned with the lp36 region, and there is comparatively little data on lpl3-pl 1. 
The best studies of proximal Ip allele loss are in breast and testicular cancers. These 
studies show LOH occurs in approximately 15-35% of breast cancers (1 1,12) and 15- 
25% of testicular cancers (13). Data from more distal loci on Ip show >25% LOH in 
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glioma, colon cancer, stomach cancer, ovarian cancer, and liver cancer (14). The LOH 
observed in this region indicates that other essential genes mapping to the lp 
chromosomal arm, and especially to the lpl 1 region, which have LOH and for which 
sequence variances, and therefore heterozygotes for a sequence variance, exist in 
normal somatic cells of individuals in a population are potential target genes 
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Example 12: Ribonucleotide Reductase, Ml subunit (RRM1) - Target Gene 
VARIA200 

Ribonucleotide Reductase is essential for cell growth 

Human ribonucleotide reductase (also called ribonucleoside diphosphate reductase) is 
essential in dividing cells for the production of deoxyribonucleotides prior to DNA 
synthesis in S phase. Ribonucleotide reductase catalyzes the reduction of all four 
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ribonucleoside diphosphates to the corresponding deoxyribonucleoside diphosphates 
by replacing the 2' hydroxyl moiety of ribose with a hydride ion to form deoxyribose; 
these reactions constitute the first committed steps in the creation of DNA precursors 
(deoxyribonucleotides), and are therefore tightly regulated by allosteric nucleotide 
binding sites on the Ml subunit (2,3). The enzyme is an 2 2 tetramer apparently 
conserved in all prokaryotes and eukaryotes (1). The two subunits, Ml and M2, are 
both required for enzyme activity. The RRM2 subunit contains the catalytic site, while 
the RRM1 subunit provides an indispensable allosteric function. (See pages 758-763 
of Biochemistry by C.K. Mathews and K.E. van Holde, Benjamin/Cummings 
Publishing Biochemistry, Company, Redwood City, 1990 for a fuller account of 
ribonucleotide reductase function.) 



Both ribonucleotide reductase subunits are expressed in all proliferating cells but are 
generally nondetectable in quiescent cells. Ribonucleotide reductase subunit M2 is the 
target of several antineoplastic compounds, including hydroxyurea. Hydroxyurea is 
used in the chemotherapy of a variety of myeloproliferative disorders (4). It acts by 
reversibly destroying a tyrosyl free radical in the catalytic site of the M2 subunit (3). 
Hydroxyurea and other ribonucleotide reductase poisons are specific for the S phase 
of the cell cycle, resulting in growth arrest at the Gl-S boundary and apoptotic death 
in tumor cells (5). Exposure of cell cultures to hydroxyurea results in selection of cells 
expressing high levels ribonucleotide reductase, demonstrating that ribonucleotide 
reductase is required for these cells to grow (6). 

The human ribonucleotide reductase gene has sequence variances 

The cDNA sequence of the human ribonucleotide reductase Ml subunit has been 
published by two groups (7,8). We undertook a systematic search for DNA sequence 
variance in the cDNA of the Ml subunit by analysing 36 unrelated individuals using 
the single strand conformation polymorphism technique. Primers were designed using 
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the sequence of Parker et al. (GENBANK accession X59543; see ref. 7). SSCP 
analysis revealed 4 sequence variances, and subsequent DNA sequence analysis 
confirmed that nucleotides 1037 (C vs. A), 2410 (A vs. G), 2419 (A vs. G) and 2717 
(T vs. A) vary as shown in the Target Summary Table. (The sequence variance at nt 
5 1 037 was previously noted by Parker et al., ref. 7.) Also, DNA sequencing revealed an 

insertion/deletion sequence variance: the 9 consecutive T nucleotides between 
positions 2724 and 2732 (numbering from ref. 7) were augmented in some cDNAs by 
a tenth T. (This sequence variance is designated T9 vs. T10 in the Target Summary 
Table.) 

10 

Both alleles at nt 1037 were detected in North American Whites, Hispanics, Chinese, 
Japanese, Arabs and Indians. Similarly, both alleles of the sequence variance at nt 
2410 were detected in virtually all tested populations: North American White, North 
American Black, Hispanic, Chinese, Arab and Indian. In contrast, the sequence 
15 variances at nt 2419 and 2717 were prevalent in North American Blacks, Hispanics, 

Chinese, and Japanese, but not North American Whites. The insertion/deletion 
sequence variance at nt 2724 was only studied in four individuals so no firm 
conclusions can be drawn regarding population distribution, however it appears to be 
in linkage disequilibrium with the 2419 and 2724 sequence variances. 

20 

The human ribonucleotide reductase gene maps to chromosome llpl5.5 

The gene for human ribonucleotide reductase has been mapped to band 1 lpl5.5 by 
several techniques. Initially the gene was localized by Southern hybridization analysis 
25 of human X rodent somatic cell hybrids and by chromosomal in situ hybridization (9). 

Subsequently RRM1 has been placed on a yeast artificial chromosome (YAC) physical 
map of chromosome 1 lpl5 (10). The precise physical localization of the RRM1 gene 
facilitates interpretation of LOH results at adjacent polymorphic markers (see below). 
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Chromosome band llpl5.5 is a site of frequent loss of heterozygosity 

The short arm of chromosome 11 is the site of several tumor suppressor genes, 
including the WT1 gene and the Beckwith-Weidemann syndrome gene. As a result 
5 there are many studies of LOH in 1 lpl5.5, particularly focusing on breast, cervix, 

kidney, liver, lung, ovarian, stomach and testicular cancers. These studies show that 
the 1 lpl5.5 band of chromosome 1 1 is frequently reduced to one copy (1 1-28). For 
example, LOH occurs in approximately 13-33% of breast cancers (1 1-13), 14-42% of 
cervical cancers (14), 0-50% of liver cancers (16), 0-80% of lung cancers (17-19), 18- 
10 54% of ovarian cancers (20,21), 0-71% of stomach cancers (22) and 0-50% of 

testicular cancers (23,24). Other studies show that 1 lpl5.5 LOH may also be frequent 
in bladder cancer (25), esophageal cancer (26), some leukemias (27) and sarcomas 
(28). Many deletions in the llpl5.5 region span relatively short chromosomal 
segments (2-10 megabases; see ref. 17). 

15 

References 



1. Caras, LW., Levinson, B.B., Fabry, M., et al. (1985) Cloned mouse ribonucleotide 
reductase subunit Ml cDNA reveals amino acid sequence homology with Escherichia 

20 Coli and herpesvirus ribonucleotide reductases. J. Biol. Chem. 260: 7015-7022. 

2. Thelander, L., and P. Reichard, (1979) Reduction of Ribonucleotides. Annu. Rev. 
Biochem. 48:133-158. 

3. Reichard, P. and A Ehrenberg (1983) Ribonucleotide reductase: a radical enzyme. 
Science 221: 514-9. 

25 4. Donehower, R.C. (1992) An Overview of the clinical experience with 

hydroxyurea. Seminars in Oncology 19:1 1-19, 1992. 

5. Wright, P.S., Cross-Doersen, D., ThOng, J.P., et al. (1996) A ribonucleotide 
reductase inhibitor, MDL 101,731, induces apoptosis and elevates TRPM-2 mRNA 
levels in human prostate tumor xenografts. Experimental Cell Research 22: 54-60. 



WO 98/41648 



PCT/US98/05419 



205 232/116 

6. Cocking, J.M., Tonin, P.N., Stokoe, et al. (1987) Gene for Ml subunit of 
ribonucleotide reductase is amplified in hydroxyurea-resistant hamster cells. Somat. 
Cell Mol Genet 13:221-33. 

7. Parker, N.J., Begley, C.G. and R.M. Fox. (1991) The Human Ml Subunit of 
5 Ribonucleotide Reductase: cDNA Sequence and Expression in Stimulated 

Lymphocytes. Nucleic Acids research 9:374 1 . 

8. Pavloff, N., Rivard, D., Masson, S, Shen, S.H. and A.M. Mes-Masson. (1992) 
Sequence Analysis of the Large and Small Subunits of Human Ribonucleotide 
Reductase. DNA Sequence 2:227. 

10 9. Brissenden, J.E., Caras, I., Icelander, L. and Francke, U. (1988) The structural gene 

for the Ml subunit of ribonucleotide reductase maps to chromosome 1 1 , band pi 5, in 
human and to chromosome 7 in mouse. Exp. Cell Res. 174:302-8. 

10. See: http://shows.med.bufTalo.edu/home.html 

11. Ali, I., Lidereau, R., Theilley, C. and R. Callahan (1987) Reduction to 
15 homozygosity of genes on chromosome 11 in human breast neoplasia. Science 

238:185-8. 

12. Winqvist, R., Mannermaa, A., Alavaikko, et al. (1993) Refinement of regional loss 
of heterozygosity for chromosome 1 lp 1 5.5 in human breast tumors. Cancer Research 
53: 4486-4488. 

20 13. Carter, S.L., Negrini, M., Baffa, R., et al. (1994) Loss of heterozygosity at 1 lq22- 

q23 in breast cancer. Cancer Research 54:6270-4. 

14. Mitra, A.B., Murty, V.V.V.S., Li, R.G., et al. (1994) Allelotype analysis of 
cervical carcinoma. Cancer Research 54:448 1 . 

15. Fujimori, M, Tokino, T., Hino, O., et al. (1991) Allelotype study of primary 
25 heptocellular carcinoma. Cancer Research 5 1 : 89-93. 

16. Wang, H.P. & C.E. Rogler (1988) Deletions in human chromosomes 1 lp and 13q 
in primary hepatocellular carcinomas. Cytogenetics and Cell Genetics 48:72-78. 

17. Bepler, G. and Garcia-Blanco, M.A. (1994) Three Tumor Suppressor Regions on 
Chromosome 1 lp Identified by High Resolution Deletion Mapping in Human Non- 



WO 98/41648 



PCT/US98/05419 



206 232/116 
Small Cell Lung Cancer. Proc. Natl Acad. Sci. U.S.A. 91:5513-7. 

18. Iizuka, M, Sugiyama, Y.,Shiraishi, M., et al. (1995) Allelic losses in human 
chromosome 1 1 in lung cancers. Genes, Chromosomes & Cancer 13:40-46. 

19. Weston, A., Willey, J.C., Modali, R, et al. (1989) Differential DNA sequence 
5 deletions from chromosomes 3, 1 1, 13 and 17 in squamous cell carcinoma , large-cell 

carcinoma and adenocarcinoma of the human lung. Proc. Natl. Acad Sci. U.S.A. 
86:5099-5103. 

20. Kiechle-Schwartz, M, Bauknecht, T., Wienker, T., et al. (1993) Loss of 
Constitutional Heterozygosity on Chromosome lip in Human Ovarian Cancer. 

10 Cancer 72:2423-32. 

21. Viel, A., Giannini, F., Tumiotti, L., Sopracordevole, F., Visentin, M.C. and M. 
Boiocchi (1992) Chromosomal localization of two putative 1 Ip oncosuppressor genes 
involved in human ovarian tumors British Journal of Cancer 66: 1030-1036. 

22. Baffa, R., Negrini, M., Mandes, B., et al. (1996) Loss of heterozygosity for 
15 chromosome 1 1 in adenocarcinoma of the stomach. Cancer Research 56: 268-72. 

23. Lothe, R.A., Hastie, N., Heimdal, K., et al. (1993) Frequent loss of lpl3 and 
1 lpl5 loci in male germ cell tumors. Genes, Chromosomes & Cancer 7:96-101. 

24 Smith, R.C., and Rukstalis, D.B. (1995) Frequent Loss of Heterozygosity at 1 lp 
Loci in Testicular Cancer. The Journal of Urology 153:1 684-7. 
20 25. Shaw, M.E. and Knowles, MA. (1995) Deletion Mapping of Chromosome 1 1 in 

Carcinoma of the Bladder. Genes, Chromosomes & Cancer 13:1-8. 

26. Shibagaki, L, Shimada, Y., Wagata, T., et al. (1994) Allelotype analysis of 
esophageal squamous cell carcinoma. Cancer Research 54: 2996-3000. 

27. Ahuja, H.G., Foti, A., Zhou, D.J. and M.J. Cline (1990) Analysis of proto- 
25 oncogenes in acute myeloid leukemia: loss of heterozygosity for the Ha-ras gene. 

B/oorf 75: 819-822. 

28. Yamaguchi, T., Toguchida, J., Yamamuro, T., et al. (1992) Allelotype analysis in 
osteosarcoma: frequent allele loss on 3q, 13q, 17p and 18q. Cancer Res. 52: 2419. 



WO 98/41648 PC77US98/05419 



207 232/116 
Example 13: Thymidylate Synthase (TS) - Target Gene VARIA250 

Thymidylate Synthase is essential for cell growth 

5 Human thymidylate synthase (TS) catalyzes the formation of thymidine 

monophosphate (dTMP) from deoxyuridine monophosphate (dUMP) by transfer of a 
methyl group from N5,N10-methylenetetrahydrofolate to carbon 5 of dUMP. This is 
the sole de novo pathway to dTMP, an essential precursor for DNA synthesis. TS also 
plays an important role in balancing the four nucleotide precursors for DNA polymer 
10 synthesis (1). Thus TS is an attractive target for antiproliferative drugs. (See 

BiQQhemigtry by C.K. Mathews and K.E. van Holde, Benjamin/Cummings Publishing 
Company, Redwood City, 1990, pages 763-768, for a fuller account of thymidylate 
synthase function.) 

Like some other growth associated genes involved in DNA synthesis, thymidylate 
synthase is expressed in proliferating cells at 20-40 fold higher levels than in quiescent 
cells. Increased expression occurs at the Gl-S transition of the cell cycle when 
quiescent cells are stimulated with serum. Levels of thymidylate synthase are finely 
controlled by autoregulatory feedback loops wherein TS protein regulates the 
transcription, stability and translational efficiency of TS mRNA (2). Transcription 
increases by only 2-4 fold, so posttranscriptional events constitute the predominant 
regulatory mechanisms (3). One mechanism of 5-FU resistance is increased expression 
ofTS Mrna. 

25 Thymidylate synthase is the target of 5-fluorouracil (5-FU), a potent antineoplastic 

compound. Once inside cells 5-FU is ribosylated and phosphorylated to 5-fluoro-2'- 
deoxyuridine S'-monophosphate (F-dUMP), which acts as an inhibitory transition state 
analog of TS when bound in the presence of the enzyme's second substrate, N5,N10- 
methylenetetrahydrofolate. (5-FU is also incorporated into both DNA and RNA, 
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augmenting its toxicity.) 5-FU induces partial responses in 10-30% of patients with 
a variety of cancers, including metastatic breast and gastrointestinal tract cancers (4). 
While 5-FU is a potent antiproliferative agent in tissue culture cells, as with most 
antineoplastic drugs, its clinical utility is limited by lack of discrimination between 
5 normal cells and tumor cells: common toxic effects include stomatitis, diarrhea, bone 

marrow suppression, hair loss and occasionally cardiac and neurologic symptoms. 

The human thymidylate synthase gene has sequence variances 

The sequence of a human thymidylate synthase cDNA was determined by Takeishi et 
al. (5), who later determined the genomic sequence as well (6). We undertook a 
systematic search for DNA sequence variance by analysing 36 unrelated individuals 
using the single strand conformation polymorphism. Primers were designed using the 
sequence of Takeishi et al. (5). SSCP analysis revealed 3 DNA fragments having 
sequence variances, and subsequent DNA sequence analysis showed that nucleotides 
1066 (C vs. T), 1 136 (A vs. G) and 1497 (A vs. T) vary among normal individuals as 
shown in the Target Summary Table. All three sequence variances are in the 3' 
untranslated region of the gene. The nucleotide 1 066 and 1497 sequence variances are 
in complete linkage disequilibrium in the 36 individuals examined. Both alleles of all 
three sequence variances were detected in North American Whites, North American 
Blacks, Chinese, Japanese, Arabs and Indians. 

Another TS sequence variance has been described by Berger and colleagues (7-9). 
They detected a T to C change at nucleotide 276 of the TS gene, resulting in the 
25 substitution of histidine for an evolutionarily conserved tyrosine at residue 33 of TS 

protein. So far the histidine allele has been detected in only one cell line, HCT1 16 (7). 
The rare his-33 form of the protein is 3-4 fold more resistant to FdUrd than the tyr-33 
form, due to an 8 fold lower catalytic efficiency (kcat), suggesting that histidine at 
residue 33 perturbs the structure of the TS active site (9) 
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The human thymidylate synthase gene maps to chromosome 18pl 1.32 

The gene for human thymidylate synthase was initially mapped to the long arm of 
chromosome 18 (18q21.31-qter) by somatic cell hybrid analysis (10), however two 
5 subsequent reports place the gene in band 18p 11.32 using fluorescence in situ 

hybridization (11,12). 

Chromosome band 18plL32 is a site of loss of heterozygosity 

10 The long arm of chromosome 1 8 contains the DCC (deleted in colon cancer) candidate 

tumor suppressor gene and has been well studied in a variety of tumors. The short arm 
(18p), where TS apparently resides, has not been studied as extensively. The available 
data suggests there is LOH in approximately 45% of colon cancers (13) and 25-30% 
of cervical (14), head and neck (15), lung (16) and ovarian (17) cancers and sarcomas. 

15 L0H has ^so been described in breast, brain, esophagus, kidney and prostate cancers 

(0-15%). 18p has not been studied for allele loss in several other major cancers, 
including bladder, leukemia, lymphoma, liver, pancreas, stomach and testicular 
cancers. 
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Example 14: Cytidine Triphosphate Synthetase (CTPS) - Target Gene VARIA260 

Cytidine Triphosphate Synthetase is essential for cell growth 

Human cytidine triphosphate synthetase catalyzes the glutamination of UTP to form 
CTP. The reaction is: UTP + ATP + glutamine --> CTP + ADP + ?i + glutamate. This 
is the rate limiting step in the synthesis of cytidine nucleotides from both the de novo 
and uridine salvage synthesis routes (see ref. 1 and references therein). CTPS also 
plays a vital regulatory function in balancing nucleotide pools for DNA polymer 
synthesis; it is allosterically regulated by CTP (negatively) and GTP (positively). 

There is compelling evidence that CTPS is essential for cell survival: 

CTPS is evolutionarily conserved in yeast and bacteria, with a high degree of 
amino acid identity in regions mediating allosteric regulation and catalysis (1- 
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3). (Another example: the human and hamster enzymes are identical in leneth 
and 98% amino acid identical over 591 amino acids.) 
Mutant hamster cells lacking functional CTPS need exogenous cytidine to 
survive (3). 

There is no known human deficiency disease of CTPS. 
CTPS function is increased in proliferating cells (4). 

Thus CTPS is an attractive target for antiproliferative drugs. Cyclopentyl cytosine 
(CPE-C) is a synthetic cytidine analog in which a cyclopentyl group replaces the furan 
ring of the ribose sugar. CPE-C has antineoplastic and antiviral effects in animal 
models (5). The drug is kinased intracellularly to the triphosphorylated nucleotide form 
(CPE-CTP). Exposure of cells to CPE-C leads to rapid depletion of CTP pools, as a 
result of inhibition of CTPS by CPE-CTP (6,7). Upregulation of CTP synthetase, or 
loss of negative allosteric modulation by CTP is associated with resistance to the 
15 cancer chemotherapy drugs arabinosyl cytosine (ara-C), 5-fluorouracil and other 

cytotoxic nucleoside analogs as well as alkylating agents (3). 

The human cytidine triphosphate synthetase gene has sequence variances 

20 The sequence of a human cytidine triphosphate synthetase cDNA was determined by 

Yamauchi et al. (1), who later determined the genomic sequence as well (2). We 
undertook a systematic search for DNA sequence variance by analysing 36 unrelated 
individuals using the single strand conformation polymorphism technique. Primers 
were designed using the sequence of Yamauchi et al. (1). SSCP analysis revealed 3 

25 DNA fragments having sequence variances, and subsequent DNA sequence analysis 

showed that nucleotides 576 (A vs. G), 2093 (C vs.7) and 2135 (G vs. A) vary among 
normal individuals as shown in the Target Summary Table. The nucleotide 576 
sequence variance is a silent substitution in the coding region, while the latter two 
sequence variances are in the 3' untranslated region of the cDNA. All three sequence 
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variances were detected at low frequency in the panel of 36 individuals (3-8%), 
however all but one of the heterozygotes is Asian, and it seems likely that a larger 
survey of Asian populations would show higher allele frequencies in Chinese and other 
groups. For example among the four Chinese in the panel two (50%) are heterozygous 
for the residue 2135 sequence variance, and one (25%) is heterozygous for the nt 576 
sequence variance. Also, the one Cambodian in the panel is heterozygous for both the 
2093 and 2135 sequence variances. 



The human cytidine triphosphate synthetase gene maps to chromosome lp34. 1 

10 

The gene for human cytidine triphosphate synthetase has been mapped to lp34.1 by 
somatic cell hybrid analysis (2). 

Chromosome band lp34. 1 is a site of frequent loss of heterozygosity 

15 

The short arm of chromosome 1 is comparatively well investigated for allele loss, 
especially in breast and colon cancers. The lp35-32 and lp22-13 regions flank lp34.1 
and are the best available markers for LOH on Ip. Studies of these regions show 30* 
50% LOH frequency in breast cancer (8-12), 41-75% in glioma (a brain cancer 
20 subtype) (13), 20-40% in colon cancer (14,15), -50% in stomach cancer (16), -20% 

in lung cancer (17) and 20-30% in ovarian cancer (1 8). High frequency LOH has been 
detected in several uncommon cancers such as pheochromocytoma (50-86%) and 
neuroblastoma (-50%). Most other common cancers have not been adequately 
investigated to assess LOH frequency in this region. 

25 
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25 Example 15: Cysteinyl tRNA Synthetase (CARS) - Target Gene VARIA301 

The human cysteinyl tRNA synthetase gene is essential for cell survival 



Cysteinyl-tRNA synthetase (CARS) catalyzes ATP dependent covalent attachment of 
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cysteine to its cognate tRNA to form cysteinyl-tRNA. In the absence of cysteinyl- 
tRNA, protein synthesis is blocked. Since Cysteinyl-tRNA synthetase is a single copy 
gene in man, inhibition of its function is expected to be cell lethal. This has been 
shown for other tRNA synthetases (summarized above). 

The human cysteinyl-tRNA synthetase gene and mRNA have sequences variances 

A human cDNA encoding cysteinyl tRNA synthetase (CARS) was cloned based on the 
similarity of a human expressed sequence tag (EST) to E. coli cysteinyl tRNA 
synthetase (1). The published human CARS cDNA is 2048 nucleotides in length and 
includes a 30 nucleotide 5' untranslated region followed by an open reading frame of 
1914 nucleotides and a 3* untranslated region of 134 nucleotides (1). An EMBL/ 
GENBANK submission (accession # L06845) by the authors of ref 1 includes a 3' 
untranslated region 423 nucleotides longer than the published sequence, but lacks 19 
consecutive A nucleotides after position 2029 (making a net increase of: 423 - 19 = 
404 nucleotides, and a composite cDNA of: 2048 + 404 = 2452 nucleotides in length. 
We have confirmed the existence of 2452 nt transcripts by PCR amplification of 
reverse transcribed mRNA.) We designed primers as shown on the annotated cDNA 
sequence and screened the composite 2452 nt cDNA for sequence variance in 36 
unrelated individuals by the single strand conformation polymorphism (SSCP) 
technique. Two sequence variances were identified. One of the sequence variances, 
located in the 5' untranslated region, was below the desired level of 20% 
heterozygosity. The other sequence variance is a C vs. T transition near the 3' end of 
the coding sequence at nucleotide 1739 (see annotated sequence). 

The human cysteinyl tRNA synthetase protein has sequence variances 
The deduced amino acid sequence of the human CARS gene encodes a protein of 638 
amino acids which probably functions as a monomer, by analogy to related 
synthetases. The deduced protein contains two sequence motifs, HIGH (residues 64- 
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67) and KMSKS (residues 406-410), which define Class I synthetases (see ref. 2 for 
background information on tRNA synthetases). These two conserved motifs form an 
ATP binding fold (the Rossman fold) in the amino terminal half of the protein. 
Cytosine at nucleotide 1739 encodes proline at residue 622 of the protein, while 
5 thymine at nucleotide 1739 encodes leucine. The pro/leu amino acid sequence 

variance is a mere 16 residues from the C terminus of the protein. The C-terminal 
portion of CARS, by analogy to other class I synthetases, contains the tRNA binding 
site. 

1 0 Frequency of CARS heterozygotes 

The frequency of heterozygotes for the nucleotide 1739 sequence variance is --45-50% 
in all major racial groups surveyed (see accompanying table), including North 
American Whites (8/15=53%), North American Blacks (2/4=50%), Chinese 
15 (2/4=50%), Swedish (127/344=37%) and Japanese (1/4=25%). The wide population 

distribution of both alleles suggests that other population groups will also have a high 
frequency of heterozygotes. 

Gene Mapping of CARS to Up] 5.5 

20 

Human CARS cDNA has been mapped to chromosome 1 1 pi 5.5 by screening human 
X Chinese hamster somatic cell hybrids informative for all human chromosomes, and 
by fluorescence in situ hybridization (3). Both mapping techniques were conclusive 
in showing only one locus for human CARS. Detailed physical maps of 1 1 pi 5.5 have 
25 subsequently allowed precise localization of the CARS gene relative to other DNA 

markers (4). 



LOH at 11 pi 5. 5 is well documented in many cancer types 

The short arm of chromosome 11, and particularly the 1 1 pi 5.5 region, is deleted in a 
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variety of human cancers, including (but not limited to) ovarian (1 8 - 50% LOH), non- 
small cell lung (22 - 71%), breast (12 - 33%), bladder (40 -50%), esophageal (18 - 
40%) and testicular cancers (18 - 66%) (refs. 5-12). Many deletions in the 1 lpl5.5 
region span relatively short chromosomal segments (2-10 megabases; see ref. 8). 
Using the specific variances identified in the CARS gene as markers for 
heterozygosity, we have determined that LOH occurs in 10/20 ovarian cancers (50%) 
and 10/52 non-small cell lung cancers (19%). 

Assays for human CARS inhibitors 

There is no published work on the protein encoded by the putative human CARS 
cDNA, nor on any other eukaryotic CARS protein, however the extensive 
characterization of other Class I synthetases from both prokaryotes and eukaryotes 
provides a template for modeling the structure of human CARS. (For an example of 
how this can be done see ref, 14, in which the three dimensional structure of human 
alanyl-tRNA synthetase has been modeled up to amino 249 by neural net software and 
multiple alignments of partial and complete human AARS sequences with 
heterologous prokaryotic class II synthetases for which crystal structures exist.) With 
respect to the C-terminal location of the variant amino acid residue in human CARS, 
it is worth noting that single amino acid substitutions in the C-terminal region of alanyl 
tRNA synthetase can have greater than 100 fold effects on catalytic activity (15). 
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Example 16: Glutamyl-Prolyl tRNA Synthetase (EPRS): - Target Gene 
VARIA300 

The human glutamyl-prolyl tRNA synthetase gene is essential for cell survival 



Glutamyl-prolyl-tRNA synthetase (EPRS) catalyzes ATP dependent covalent 
attachment of glutamine and proline to their cognate tRNAs to form glutamyl-tRNA 
15 and prolyl-tRNA. In the absence of glutamyl-tRNA or prolyl-tRNA, protein synthesis 

is blocked. Since glutamyl-prolyl-tRNA synthetase is a single copy gene in man, 
inhibition of its function is expected to be cell lethal. This has been shown for other 
tRNA synthetases (summarized above). 

20 The human glutamyl-prolyl tRNA synthetase gene, mRNA and protein have sequence 

variances 

A human cDNA encoding glutamyl-prolyl tRNA synthetase (EPRS) was initially 
misidentified as glutaminyl-tRNA synthetase (1) based on misleading sequence 
25 alignments with bacterial and yeast glutaminyl-tRNA synthetase (2). Subsequently, 

biochemical studies of the protein encoded by a D. melanogaster gene -70% identical 
to the human gene demonstrated glutamyl (not glutaminyl) tRNA synthetase activity, 
and also showed that a single gene encodes both glutamyl- and prolyl-tRNA 
synthetases in the fly (3). These observations eventually led to the realization that 
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human EPRS is also a single polypeptide containing two synthetases (2), The 
aminoacyl tRNA synthetases are divided into two classes (see Background on tRNA 
Synthetases, above). Glutamyl-tRNA synthetase belongs to Class I while Prolyl-tRNA 
synthetase belongs to class II. Thus the two halves of EPRS evolved independently 
and likely represent an evolutionary recent fusion. The published human EPRS 
cDNA is 4,586 nt long and includes a 5 f untranslated region of 58 nt followed by an 
open reading frame of 4320 nt and a 3' untranslated sequence of 208 nt (1). The gene 
encodes a polypeptide of 1440 amino acids. The glutamyl-tRNA synthetase activity 
is encoded by an imprecisely defined segment at 5* end of the gene probably spanning 
at least amino acids 105-426, while the prolyl-tRNA synthetase activity is encoded by 
a segment likely including residues 942-1369 at the 3" end of the gene (2). The two 
synthetase moieties are connected by a central domain of unknown function. It has 
been speculated that the central domain may attach the enzyme to the cytoskeleton or 
to other aminoacyl-tRNA synthetases in a multienzyme complex (2, 3). 

The human glutamyl-prolyURNA synthetase gene andmRNA have sequence variances 
We designed primers and screened the 4586 nt cDNA for sequence variance in 36 
unrelated individuals by the single strand conformation polymorphism technique. 
Seven sequence variances were identified, four located in the coding sequence and 
three located in the 3' untranslated region. As shown on the Annotated Glutamyl- 
Prolyl tRNA Synthetase cDNA Sequence and in the Target Summary Page, the 
sequence variance nucleotides are 2520 (C vs. A), 2944 (G vs. A), 2963 (C vs. T), 2969 
(A vs. G), 3247 (A vs. G), 4459 (G vs. A) and 4506 (G vs. A). The sequences flanking 
the alternate allelic forms and their frequencies of occurrence are shown on the Target 
Summary Page. Less than 10% of individuals surveyed are heterozygous for sequence 
variances at 2520, 2944 and 2963. Heterozygotes for the other 4 sequence variances 
occur more frequently and appear to be widely distributed in the surveyed populations 
(see below). 
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The human glutamyl-prolyl tRNA synthetase protein has sequence variances Three 
nucleotide sequence variances, at 2520, 2963 and 2969, alter the amino acid coding 
sequence of EPRS at residues 821 (pro/his), 969 (his/tyr) and 971 (ile/val). The 
residue 821 his and 969 tyr alleles are relatively rare, with fewer than 10% 
5 heterozygotes in the surveyed populations. The more common residue 971 sequence 

variance lies in the PRS domain of the protein, near one of the widely conserved 
defining motifs for class II tRNA synthetases. 



EPRS heterozygotes are frequent in non- Asian populations. While the overall 
10 frequency of residue 971 heterozygotes is 8/36 (24%), the frequency of heterozygotes 

varies among different populations. For example, there are no heterozygotes among 
10 Asians surveyed (Chinese, Japanese, Filipino and Korean), while 8/26 (31%) of 
non-Asians, including North American Whites, Blacks and Hispanics, are 
heterozygotes. 

15 

The EPRS Gene Maps to Iq41-q42 



Human EPRS cDNA has been mapped to chromosome 1 q4 1 -42 by screening human 
X Chinese hamster somatic cell hybrids informative for all human chromosomes, and 
20 by fluorescence in situ hybridization (3). Both mapping techniques were conclusive 

in showing only one locus for human EPRS. 



Loss of heterozygosity at lq4U42 is documented in several cancer types. 17-25% of 
breast cancers have allele loss in the Iq41-q42 region (4, 5), 29-46% of colon cancers 
25 (6, 7) and 17-26% of cervical cancers (8). One report describes 27% LOH in stomach 

cancer (9). One or two studies of brain, esophageal, kidney, liver and ovarian cancers 
also report LOH. No studies of LOH in the Iq41-q42 region have been reported in 
bladder, endocrine, head and neck, lung, or pancreas cancers or in leukemia or 
lymphoma. 
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Antisense considerations The sequence variances at 2963 and 2969 are close enough 
that a 20-mer antisense oligonucleotide could easily span them. Such an 
oligonucleotide should afford greater allele discrimination than is possible with a 
single nucleotide difference. However, the 2963 sequence variance is fairly rare 
5 (<10% heterozygotes) and not in linkage disequilibrium with the 2963 sequence 

variance, so there are more than two haplotypes in the populations tested. 
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Example 17: Alanyl-tRNA Synthetase (AARS) - Target Gene VARIA304 
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The human glutamyl-prolyl tRNA synthetase gene is essential for cell survival 

Alanyl-tRNA synthetase (AARS) catalyzes ATP dependent covalent attachment of 
alanine to its cognate tRNA to form alanyl-tRNA. In the absence of alanyl-tRNA, 
5 protein synthesis is blocked. Since alanyl-tRNA synthetase is a single copy gene in 

man (see below) inhibition of its function is expected to be cell lethal. This has been 
shown for other tRNA synthetases (summarized above). 

The human alanyl-tRNA synthetase gene and mRNA have sequence variances 

10 

A human cDNA encoding alanyl tRNA synthetase (AARS) was cloned by Shiba et al. 
(1) using cross species PCR: AARS sequences from four evolutionarily distant species 
were compared and primers were designed to conserved regions specific to AARS. 
The cloned human cDNA is 3344 nt in length and includes a 1 10 nt 5 1 untranslated 
15 region, an open reading frame of 2904 nt encoding a 968 residue polypeptide, and a 3' 

untranslated region of 330 nt (ref. 1; Genbank accession D32050). 

We designed primers. The 3344 nt cDNA was screened for sequence variance in 36 
unrelated individuals by the single strand conformation polymorphism (SSCP) 
20 technique. One sequence variance was identified, a C vs. T transition at nucleotide 

1013, within the coding sequence. The published nucleotide at position 1013 is T (1). 

The frequency of AARS heterozygotes is 25-50% in all populations surveyed. The 
frequency of heterozygotes for the nucleotide 1013 sequence variance is 57% in the 36 
25 individuals tested. Both alleles are present in all major racial groups surveyed (see 

Target Gene Summary Table), including North American Whites (9/15=60% 
heterozygotes), North American Blacks (3/4=75%), Chinese (2/4=50%), Japanese 
(1/4=25%) and Hispanic (1/2). The wide population distribution of both alleles 
suggests that other population groups will also have a high frequency of heterozygotes. 
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The AARS gene maps to 16q22 

The human AARS cDNA has been mapped to chromosome 16q22 by us and by 
Nichols et al. (ref. 2). We designed primers to the 3* untranslated region of AARS and 
5 used PCR to analyze the National Institute of General Medical Sciences (NIGMS) 

Human/Rodent Somatic Cell Hybrid Mapping Panel #2 (see page 704 of the NIGMS 
1994/1995 Catalog of Cell Lines, available from the Coriell Cell Repository, Camden, 
NJ). The panel consists of 24 hybrid cell lines, each monochromosomal for one human 
chromosome. The AARS PCR product mapped to the hybrid containing human 

10 chromosome 16. Subsequently we screened the Radiation Hybrid Mapping Panel 

created at Stanford University (rhserver@shgc.stanford.edu) and distributed by 
Research Genetics (RH01). The AARS PCR product mapped near D16S496 with a 
lod score>10. D16S496 is a polymorphic DNA marker at 16q22. The AARS PCR 
product mapped near D16S496 with a LOD score >10. DH16S496 is a polymorphic 

15 DNA marker at 16q22. (See, ref. 29 for a full explanation of modification hybrid 

mapping.) Similar results were obtained by Nichols et al., who mapped AARS by 
analysis of the same NIGMS hybrid mapping panel, by PCR mapping in a 
chromosome 16 regional mapping panel and by fluorescence in situ hybridization to 
metaphase chromosomes. All mapping techniques were conclusive in showing only 

20 one locus for human AARS. 



LOH at 1 6q22 is well documented in many cancer types. Loss of heterozygosity 
studies of chromosome 16q have principally focused on breast and liver cancers. In 
six detailed studies of breast cancer in the 16q22 region LOH frequencies of 40-60% 
25 have been reported (refs 3-8). 16q22 LOH has ben reported in 25-90% of liver cancers 

(9-13), with the average around 45%. Less extensive studies of other cancer types 
report 16q22 LOH in 19% of bladder cancers, 20% of colon cancers (14), 19-27% of 
esophageal cancers (15), 25% of small cell lung cancers (16), 16-37% of ovarian 
cancers (17-19) and 22% of uterine cancers (20), and 3 1-50% of prostate cancers (21- 
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22). 
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Example 18: Threonyl-tRNA Synthetase (TARS) - Target Gene VARIA302 

The human threonyl-tRNA synthetase gene is essential for cell survival 

10 Threonyl-tRNA synthetase (TARS) catalyzes ATP dependent covalent attachment of 

threonine to its cognate tRNA to form threonyl-tRNA. In the absence of threonyl- 
tRNA, protein synthesis is blocked. Threonyl-tRNA synthetase is a single copy gene 
in man (see below) and inhibition of TARS is cell lethal. This has been shown using 
the specific TARS inhibitor borrelidin, a threonine analog. Borrelidin resistant CHO 

15 cel1 lines hav e been isolated; the most resistant lines contain -60-100 fold more 

immunologically reactive protein and 10-20 fold higher TARS activity than non- 
selected CHO cells (1-3). 

The human TARS enzyme is a homodimeric member of the class II tRNA synthetases. 
20 Tne human protein is 53% amino acid identical to S. cerevisiae cytoplasmic TARS, 

40% amino acid identical to E. coli TARS and 39% amino acid identical to yeast 
mitochondrial TARS. The degree of evolutionary conservation is 52-64% when 
conservative substitutions are allowed. 

25 The human Threonyl-tRNA synthetase gene and mRNA have sequence variances. A 

human cDNA encoding threonyl tRNA synthetase was cloned by Cruzen and Arfin 
(GENBANK accession M63 180; ref. 2) using anti-TARS antibodies to screen a lgtl 1 
expression library. The cDNA is 2644 nt in length and includes a 138 nt 5' untranslated 
region, an open reading frame of 2136 nt encoding a 712 residue polypeptide, and a 3' 
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untranslated region of 370 nt. 

We designed primers for amplification. The 2644 nt cDNA was screened for sequence 
variance in 36 unrelated individuals by the single strand conformation polymorphism 
(SSCP) technique. Three sequence variances were identified: G vs. A transitions at 
nucleotides 1608 and 1755 within the coding sequence, and a C vs. T transition at 
nucleotide 2395 of the 3' untranslated region. None of the sequence variances alters 
the sense of the coding strand. The published sequence shows G, G and T at the three 
sequence variance sites 



The frequency of TARS heterozygotes is 25-45% in all populations surveyed. The 
nucleotide 1608 sequence variance was genotyped only in North American Whites, 
45% of whom were heterozygotes. The nucleotide 1608 and 1755 sequence variances 
were both genotyped in 36 individuals, with overall heterozygosity rates of 31% and 
1 5 25 ° /o > respectively. Both sequence variances were detected in North American Whites, 

North American Blacks, Hispanics and Chinese. Of 14 North American Whites 
genotyped at all 3 sequence variance nucleotides, 1 1 (79%) were heterozygous for a 
least one polymor-phism (see threonyl tRNA synthetase summary table). 

20 The TARS gene maps to 5pl3-CEN. The human TARS cDNA has been mapped to 

chromosome 5pl3-CEN by analysis of TARS isoelectric focusing patterns in 
human/Chinese hamster hybrids (). The mapping studies were consistent with one 
human TARS locus. 



LOH at 5pl 3-CEN is documented in several cancer types. The best data on 5p LOH 
is in cervical cancer where 9 markers have been tested in 3 different studies. The 
frequency of LOH ranges from 12-57%, averaging -45%. Other cancers that have been 
studied are breast (10-24% LOH), head and neck (20% LOH), adenocarcinoma of the 
lung (40% LOH, but only 5 cancers were studied), melanoma (40%) and ovary (15- 
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21%). 

Assays for human TARS inhibitors. Human TARS protein is a homodimeric class II 
synthetase. Antibodies to rat TARS were used to clone the human protein. The high 
5 degree of amino acid conservation throughout the protein suggests that it may be 

possible to create yeast and/or bacterial strains with human CARS. 
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15 Example 19: Glutaminyl-tRNA Synthetase (QARS) - Target Gene VARIA305 

The human glutaminyl-tRNA synthetase gene is essential for cell survival 

Glutaminyl-tRNA synthetase (QARS) catalyzes ATP dependent covalent attachment 
20 of glutamine to its cognate tRNA to form glutaminyl-tRNA. In the absence of 

glutaminyl-tRNA, protein synthesis is blocked in eucaryotic cells. Glutaminyl-tRNA 
synthetase is a single copy gene in man . Inhibition of its function is expected to be 
cell lethal, as shown for other tRNA synthetases (summarized above). 

25 The human Glutaminyl-tRNA synthetase gene and mRNA have sequence variances. 



A human cDNA encoding glutaminyl tRNA synthetase (QARS) was cloned by 
Lamour et al. (1) who expressed the cDNA in E. coli and demonstrated glutaminyl 
tRNA synthetase activity in bacterial extracts. The cloned human cDNA 
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(Genbank/EMBL accession number X76013) is 2437 nt in length and includes a 5' 
untranslated region of 5 nucleotides, an open reading frame of 2325 nucleotides 
encoding a 775 amino acid polypeptide, and a 3' untranslated region of 107 nt 
including 8 terminal nt of poly A. 

5 

We designed primers for amplification. The QARS cDNA was screened for sequence 
variance in 36 unrelated individuals using the single strand conformation 
polymorphism (SSCP) technique. One sequence variance was identified, a C vs. T 
transition at nucleotide 404, within the coding sequence. The published nucleotide at 
10 position 404 is C. The sequence variance does not affect the protein encoded. 

The frequency of heterozygotes for the nucleotide 404 sequence variance is 
1 1% in the 36 individuals tested (4/36). However three of 16 North American Whites 
are heterozygotes (19%), and one of four Japanese (25%) (see Target Gene Summary 
15 Table). 

The QARS gene maps to 3p 

The human QARS cDNA has been mapped to chromosome 3 by hybridization of a 
20 QARS probe to a panel of 25 human/rodent somatic cell hybrids (1). One somatic cell 

hybrid, not known to contain human chromosome 3, was positive for both the QARS 
probe and an ACY1 probe. ACY1 maps to human 3p21, suggesting QARS may also 
map in this area. We independently mapped QARS to chromosome 3 using primers 
from the 3' untranslated region to analyze the National Institute of General Medical 
25 Sciences (NIGMS) Human/Rodent Somatic Cell Hybrid Mapping Panel #2 by PCR 

(see page 704 of the NIGMS 1994/1995 Catalog of Cell Lines, available from the 
Coriell Cell Repository, Camden, NJ). The panel consists of 24 hybrid cell lines, each 
monochromosomal for one human chromosome. The QARS PCR product mapped to 
the hybrid containing human chromosome 3. All mapping techniques were conclusive 
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in showing only one locus for human QARS. 

Chromosome band 3p21 is a site of frequent loss of heterozygosity. The short arm of 
chromosome 3 has been well studied in breast, cervical, esophageal, kidney, and lung 
5 cancers. These studies report frequent allele loss at 3p21, varying up to 100% in some 

studies of small cell lung cancer. Among other cancers LOH occurs in approximately 
20-30% of breast cancers (2,3), 30-60% of cervical cancers (4,5), 10-40% of 
esophageal cancers (6,7), 45-80% of kidney cancers (8-10), 50-100% of 
nasopharyngeal cancers (11), 0-75% of squamous cell head and neck cancers (12), 30- 
10 60% of melanomas (13), 30-100% of non-small cell lung cancers (14-16) and 80- 

100% in small cell lung cancer (17-19). Other for which there are reports of LOH in 
at least 20% of cases include leukemia, pancreas cancer, sarcoma, testis cancer and 
ovarian cancer. Other cancer types, including bladder and lymphoma, have not been 
studied for LOH at 3p21. 

15 
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10 

Example 20: Lysyl-tRNA Synthetase (KARS) - Target Gene VARIA303 

Human Lysyl t-RNA synthase gene is essential 

Lysyl-tRNA synthetase (KARS) catalyzes ATP dependent covalent attachment of 
lysine to its cognate tRNA to form lysyl-tRNA. In the absence of lysyl-tRNA, protein 
synthesis is blocked. Since lysyl-tRNA synthetase is a single copy gene in man, 
inhibition of its function is expected to be cell lethal. This has been shown for other 
tRNA synthetases (summarized above). 

The human Lysyl-tRNA synthetase gene and mRNA have sequence variances 

A human cDNA encoding a sequence similar to bacterial lysyl tRNA synthetases was 
cloned by Nomura et al. (GenBank/DDBJ submission D31890; see ref. 1) while 
25 sequencing random cDNAs. No biochemical studies of the protein encoded by this 

sequence have been reported. The 5' end of the sequence apparently begins in the 
coding region and the open reading frame continues for 1805 nucleotides, encoding 
601 residues of a polypeptide (the foil length of which has not been established), 
followed by a 3' untranslated region of 165 nucleotides. 



15 



20 
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We designed primers for amplification. The reported partial cDNA was screened for 
sequence variance in 36 unrelated individuals using the single strand conformation 
polymorphism (SSCP) technique as described in the methods section. Two sequence 
variances were identified, an A vs. G transition at nucleotide 89 and a G vs. C 
transversion at nucleotide 1798, both within the coding sequence. The published 
nucleotides are A and G, respectively. The nucleotide 1 798 sequence variance alters 
the sense of the 599th codon (the third codon from the end of the coding sequence) to 
serine vs. threonine. 

The frequency of KARS heterozygotes varies among the populations surveyed. The 
frequency of heterozygotes for the nucleotide 89 sequence variance is 19% in the 36 
individuals tested. However all heterozygous individuals were either North American 
Whites (4/16; 25% heterozygotes), North American Blacks (1/4; 25%), or Hispanics 
(1/3; 33% heterozygotes). The frequency of heterozygotes for the nucleotide 1798 
sequence variance is 6% in the 36 individuals tested. However all heterozygous 
individuals were North American Blacks (2/4; 50%) (see Target Gene Summary 
Table). Further study of these and other population groups will better establish the 
frequency of heterozygotes for these two sequence variances. 

The KARS gene maps to 16q23-q24 

The human KARS cDNA has been mapped to chromosome 16q22 by Nichols et al. 
(ref 2) and by us. We designed primers to the 3' untranslated region of KARS and 
used PGR to analyze the National Institute of General Medical Sciences (NIGMS) 
Human/Rodent Somatic Cell Hybrid Mapping Panel #2 (see page 704 of the NIGMS 
1994/1995 Catalog of Cell Lines, available from the Coriell Cell Repository, Camden, 
NJ). The panel consists of 24 hybrid cell lines, each monochromosomal for one human 
chromosome. The KARS PCR product mapped to the hybrid containing human 
chromosome 16. Similar results were obtained by Nichols et al., who mapped KARS 
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by analysis of the same NIGMS hybrid mapping panel, by PCR mapping in a 
chromosome 16 regional mapping panel and by fluorescence in situ hybridization to 
metaphase chromosomes. The in situ hybridization showed KARS maps to 16q23- 
q24. All mapping techniques were conclusive in showing only one locus for human 
5 KARS. 

Loss of heterozygosity occurs frequently at 16q23-q24 in many cancer types. Loss of 
heterozygosity studies of chromosome 16q have principally focused on breast and liver 
cancers. In six detailed studies of breast cancer in the 16q23-q24 region LOH 

10 frequencies of 30-60% have been reported (refs 3-8). 1 6q22 LOH has ben reported in 

35-65% of liver cancers (9-13), with the average around 45%. Studies of other cancer 
types report 16q22 LOH in 19% of colon cancers (14), 17-27% of esophageal cancers 
(15,16), 37% of ovarian cancers (new ref) (17-19), 18% of prostate cancers (20) and 
23% of uterine cancers (21). Cancer types not yet investigated for LOH include 

15 kidney, leukemia and lymphoma, lung, melanoma, neuroblastoma, stomach and testis. 
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Example 21: Ribosomal Protein S14 (RPS14) - Target Gene VARIA326 

15 

Ribosomal protein SI 4 is essential for cell growth 



Human ribosomal protein S 14 (RPS14) is one of -80 unique protein constituents of the 
mammalian ribosome. Many of the protein subunits of ribosomes, the protein making 

20 machines of all cells, are highly conserved throughout prokaryotic and eukaryotic 

evolution (1). For example, human RPS14 protein is 100% amino acid identical to 
hamster S14 protein, 72% identical to yeast rp59 protein and 43% identical to E. Coli 
ribosomal protein SI 1 (2,3). Mammalian S 14 and yeast rp59 are components of the 
40S ribosomal subunit while E. coli Sll is part of the corresponding bacterial S30 

25 subunit. Thus human RPS14 is a ribosomal component fixed early in evolution. 

There are many antibiotics and eukaryotic cell poisons that act by inhibiting ribosome 
function (reviewed in ref. 4). One such drug is emetine, which inhibits protein 
translation by interacting with the eukaryotic RPS14 subunit to prevent elongation 
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factor dependent translocation of peptidyl-tRNAs bound to eukaryotic ribosomes in 
vitro (4). 

Chinese hamster ovary (CHO) cell lines resistant to emetine have been shown to 
contain mutant RPS14 loci (also referred to as the EMTB locus) (5). Such lines have 
been used to investigate the effects of mutant RPS14 on ribosome function (5-8). 
Human-CHO cell hybrids are emetine-sensitive, indicating that the EMTB/RPS14 
mutation is recessive in CHO cells. This is apparently because arrest of protein 
synthesis in half of ribosomes blocks translation of all polysomic mRNAs by blocking 
any functional ribosomes upstream of frozen mutant ribosomes. RPS14 appears to 
contribute to the structural integrity of the 40S subunit: 40S subunits containing 
mutant S14 protein are more easily dissociable in high ionic strength wash buffers (9). 
Ribosomal subunit genes are coordinately expressed in all cells and ribosomal proteins 
constitute a large fraction of the cell mass in all cell types. 

The human RPS14 gene has sequence variances 

Rhoads et al. reported the sequence of the human RPS14 gene and cDNA (3). The 
cDNA contains a 33 nucleotide 5' untranslated region, a 453 nt coding region and a 60 
nt 3' untranslated region (including 12 nt of poly A). We undertook a systematic search 
for DNA sequence variance in the cDNA of RPS14 by analysing 36 unrelated 
individuals using the single strand conformation polymorphism technique. Primers 
were designed using the sequence of Rhoads et al. (GENBANK accession Ml 3934, 
Ml 3641; see ref. 3). SSCP analysis revealed 1 sequence variance, and subsequent 
DNA sequence analysis confirmed an A vs. G transition at nucleotide 1 83 of the coding 
sequence. (This change was noted as a difference between the cDNA and genomic 
sequences in ref. 3.) 

As shown in the Target Summary Table, both alleles were detected in all major 
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populations surveyed, including North American Whites, North American Blacks, 
Hispanics, Chinese and Japanese. 

The human RPS14 gene maps to chromosome 5q23-q33 

5 

Dana and Wasmuth (1 1) used Chinese hamster/human somatic cell hybrids to map the 
RPS14 gene (designated EMTB) to 5q23-5q35. Later Nakamichi et al. (12) placed the 
RPS14 gene on the segment 5q23-q33 using similar techniques. 

10 Chromosome band 5q23-q33 is a site of frequent loss of heterozygosity. There have 

been many studies of LOH on 5q, particularly the 5q21-q22 region where the 
Adenomatous Polyposis Coli (APC) tumor suppressor gene lies. The most extensively 
studied cancers are those of the gastrointestinal tract, lung and ovary. The available 
data on the 5q23-q33 region just distal to APC (where RPS14 lies), suggests that LOH 

15 occurs in this region at a frequency of -30% in cervical cancer (13), 20-40% in colon 

cancer (14,15), 30-50% in ovarian cancer (16,17), 38% in stomach cancer (18) and 
23% in testicular cancer (19). There is also evidence for LOH in head and neck, lung, 
and liver cancers. 
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Example 22: Eukaryotic Initiation Factor 5A (eIF-5A) - Target Gene VARIA351 

Initiation Factor 5 A is essential for cell growth 

Human Initiation Factor 5 A (eIF-5A), formerly named Initiation Factor 4D, is an 1 8- 
kD protein which promotes formation of the first peptide bond in in vitro translation 
systems - hence the name 'initiation factor' (1,2); however, the full physiological role 
of eIF-5A is not understood. Inhibition of elF 5 A formation blocks proliferation in all 
tested cell types (3); the presence of functional elF 5A has been shown to correlate 
with the onset of DNA replication (4) - perhaps due to elF 5A dependent translation 
of mRNAs encoding proteins necessary for DNA replication (3), and eIF-5A is an 
essential co-factor for HIV-1 Rev protein (5). 

elF 5A is an unusual protein: one of its lysine residues (amino acid 50) is modified by 
transfer and hydroxy lation of the butylamino-group from the polyamine spermidine to 
form hypusine, a post translational modification unique to elF 5A. All of the 
biological activities of elF 5A are abrogated in the absence of the hypusine 
modification, as demonstrated by pharmacological inhibition of hypusine formation 
in human cell lines (3) and by site directed mutagenesis of the modified lysine residue 
in the yeast enzyme (6). There are two enzymes responsible for hypusine formation, 
one of which, deoxyhypusyl hydroxylase, can be inhibited with the drug mimosine (3), 
providing a convenient pharmacological inhibitor of eFI 5A formation. 
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The genome of the yeast Saccharomyces cerevisiae encodes two elF 5A genes. 
Disruption of one (form A) slows growth, disruption of the other (form B) arrests 
growth and strains with both forms disrupted are non-viable (6). The yeast A form 
substitutes for human elF 5A in the mammalian methionyl-puromycin synthesis assay 
(6), while the human gene complements elF 5 A disrupted yeast (7). elF 5 A is a highly 
conserved protein, with counterparts in archeae, bacteria and eukaryotes. The yeast 
proteins are -63% identical to the human protein (6). 

The human elF 5 A gene and mRNA have sequence variances 

Smit-McBride, et al. reported the sequence of a human cDNA encoding eIF-5A (8) and 
Koettnitz et al. (8) later reported the sequence of the active elF 5A gene, which 
contains three introns (GenBank accession U 17969). A composite sequence made 
from the cDNA and genomic versions is 1309 nucleotides long and contains a 5' 
untranslated region of 145 nucleotides, a 462 nt coding region and a 702 nt 3' 
untranslated region (see annotated sequence). We undertook a systematic search for 
DNA sequence variance in the cDNA of elF 5A by analysing 36 unrelated individuals 
using the single strand conformation polymorphism technique. Primers were designed 
for amplification. SSCP analysis revealed 2 sequence variances, and subsequent DNA 
sequence analysis confirmed an A vs. G transition at nucleotide 623 and a T vs. C 
transition at nucleotide 1012, both in the 3* untranslated sequence. 

Neither sequence variance affects the protein coding sequence, however nucleotide 623 
is one nucleotide away from a splice acceptor site at position 622, and could therefore 
be targeted by an oligonucleotide intended to abrogate splicing in an allele specific 
manner. The second exonic nucleotide (+2 position) of a splice acceptor site is not 
highly conserved, nonetheless the A vs. G transition at nucleotide 623 may affect the 
mechanics of splicing. 
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As shown in the Target Summary Table, both alleles were detected in all major 
populations surveyed, including North American Whites, North American Blacks, 
Hispanics, Arabs, Indians and Japanese, except only the nucleotide 1012 variance was 
detected in the four Chinese surveyed. The overall frequency of heterozygotes was 
5 37% for the nucleotide 623 sequence variance and 52% for the nucleotide 1012 

sequence variance. 

The human elF 5 A gene maps to chromosome 17pl3-pl2 

10 Steinkasserer et al. (1995) mapped the elF 5A gene to 17pl3-pl2 by fluorescence in 

situ hybridization (9). Three elF 5A pseudogenes were mapped to 10q23, 17q25 and 
19ql3. 

Chromosome band 1 7 pi 3~pl2 is a site of frequent loss of heterozygosity. There have 
15 been many studies of LOH on 17p, particularly the 1 7pl3 region where the p53 tumor 

suppressor gene maps. Virtually all cancer types have been surveyed for LOH in this 
area, with particularly extensive studies of breast, colon, ovarian, and stomach cancers. 
These studies report LOH in approximately 40-60% of breast cancers (10-18), 50-70% 
of colon cancers (19-25), 25-75% of ovarian cancers (26-30), 20-60% of stomach 
20 cancers (3 1-34), 20-50% of brain cancers (35,36), 45-70% of esophageal cancers (37), 

35-65% of non-small cell lung cancers (38,39) and 100% of small cell lung cancers, 
15-50% of cervical cancers, 30-80% of head and neck cancers, 20-60% of liver 
cancers, over 50% of sarcomas and 10-30% of a variety of other cancer types. 
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Example 23: Replication Protein A, 32 kDa Subunit (RPA32) - Target Gene 
VARIA402 

The human RPA32 gene encodes a protein essential for cell survival 

Replication Protein A (RPA; also known as Replication Factor A, Activator 1, Single 
Strand Binding Protein or SSB) is a heterotrimeric protein which participates in DNA 
replication, homologous recombination and nucleotide excision repair (1-3). The 
evidence that RPA is an essential protein comes from in vitro and in vivo data. 

DNA replication is essential for cell proliferation, as discussed above for 

RPA70. 

The best studied function of RPA32 is in DNA replication. Because of the 
complexity of DNA replication in higher eukaryotic genomes, the small 
genome of the papovavirus SV40 has been used as a model system to study 
DNA replication in human cell extracts. In the 1980s several research groups 
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developed cell free systems to study DNA replication using SV40 
chromosomes as templates (4-8). An effort to identify the minimal set of 
factors required for DNA replication led to the discovery of RPA. Subsequent 
work proved that each of the three subunits of RPA is essential for DNA 
replication (9,10). This was proved in several ways, including by using 
antibodies to various constituents of the replication complex. Anti-RPA32 
antibodies inhibit DNA replication, providing clear in vitro evidence for the 
essential function of this subunit of RPA in human DNA replication (10). 
The yeast S. cerevisiae has a trimeric replication protein A which is structurally 
and functionally homologous to the human protein. It consists of three 
subunits similar in size to the human subunits. All three yeast subunits have 
been disrupted and each disruption produces non-viable yeast (9). 

The human RPA32 gene and mRNA are polymorphic. 

The published cDNA for the 32 kD subunit of Replication Protein A is 1512 
nucleotides long and includes a 5' untranslated segment of 77 nucleotides, followed by 
a protein coding region of 810 nucleotides and a 3' untranslated region of 625 
nucleotides (10). We undertook a systematic search for DNA polymorphism by 
analysing the RPA32 cDNA from 36 unrelated individuals using the single strand 
conformation polymorphism technique (described in the methods section). Primers 
were designed using the sequence of Erdile et al. (GenBank accession J05249; see ref. 
10). SSCP analysis revealed 2 variances, one of which was sequenced. Sequencing 
revealed a G vs. A transition at nucleotide 40 of the 5' untranslated region. Four of 36 
individuals were heterozygotes, all of them Caucasians. Thus the allele frequency is 
25% (4/16) in North American Whites, while no heterozygosity was detected in other 
populations (see Target Summary sheet). 

The RPA 3 2 gene maps to chromosome lp35 
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The gene for RPA32 was mapped to chromosome band lp35 by in situ hybridization, 
somatic cell hybrid analysis and yeast artificial chromosome mapping (11,12). Only 
one locus was detected by all methods. 

5 

Chromosome band lp35 is a site of frequent loss of heterozygosity. The short arm of 
chromosome 1 is comparatively well investigated for allele loss, especially in breast 
and colon cancers. Studies of the lp35 region show LOH in 15-40% of breast cancers 
(13,14), -50% of gliomas (a brain cancer subtype) (15), 20-70% of colon cancers 
10 (16,17), -50% of stomach cancers (18), -20% of lung cancers (19) and 10-30% of 

ovarian cancers. High frequency LOH has been detected in several uncommon cancers 
such as pheochromocytoma (50-80%) and neuroblastoma (-50%). 
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Example 24: Replication Protein A, 70 kD subunit (RPA70) - Target Gene 
VARIA401 

The human RPA 70 gene encodes a protein essential for cell survival 

Replication Protein A (also known as Replication Factor A, Activator or Single Strand 
Binding protein [SSB]) is a heterotrimeric protein which participates in DNA 
replication, homologous recombination and nucleotide excision repair (1-3). The 
evidence that RPA is an essential protein comes from in vitro, in vivo and evolutionary 
data. 

DNA replication is essential for cell proliferation, and a variety of 
antiproliferative drugs act, at least in part, by inhibiting DNA replication, Such 
drugs include nucleotide analogs that block DNA polymerases, such as 2',3' 
dideoxy NTPs and 3' deoxy ATP (cordycepin); inhibitors that bind to or 
modify DNA such as intercalating agents, DNA crosslinking drugs or 
alkylating agents, and inhibitors that bind to polymerases and replication 
proteins such as topoisomerase inhibitors like the epipodophyllotoxins, which 
prevent DNA unwinding necessary for replication (and transcription) and 
antibiotics which bind to polymerases such as arylhydrazino-pyrimidines. 
The best studied function of RPA70 is in DNA replication. Because of the 
complexity of DNA replication in higher eukaryotic genomes, the small 
genome of the papovavirus SV40 has been used as a model system to study 
DNA replication in human cell extracts. In the 1980s several research groups 
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developed cell free systems to study DNA replication using SV40 
chromosomes as templates (4-8). These studies, in seeking to identify the 
minimal set of factors required for DNA replication, led to the discovery of 
replication protein A. Subsequent work proved that each of the three subunits 
of RPA is essential for DNA replications. This was proved in several ways, 
including by using antibodies to various constituents of the replication 
complex. These antibodies are effectively inhibitors of RPA70. Anti-RPA70 
antibody mediated abrogation of DNA replication provides clear in vitro 
evidence for the essential function of RPA70 in human DNA replication (10). 
The yeast S. cerevisiae has a trimeric replication protein A which is structurally 
and functionally homologous to the human protein. It consists of three 
subunits similar in size to the human subunits. The yeast 70 kDa subunit is 
31% identical and 75% similar (including conserved amino acids) to its human 
counterpart (1). All three yeast subunits have been disrupted and each 
disruption produces non-viable yeast. The yeast 70 kD protein is also a single 
stranded DNA binding protein. 

Single stranded DNA binding proteins (SSBs) are required for DNA replication 
in a wide variety of organisms, including bacteriophage, bacteria and some 
DNA viruses of higher eukaryotes. Recently the crystal structure of the DNA 
binding domain of human RPA was solved and found to be remarkably similar 
in three dimensional shape to the bacteriophage single stranded DNA binding 
proteins Pf3 and gene V from fl phage. 

The human RPA 70 gene, mRNA and protein have sequence variances 

The published cDNA for the 70 kD subunit of Replication Protein A is 2393 
nucleotides long and includes a 5' untranslated segment of 69 nucleotides, followed by 
a protein coding region of 1848 nucleotides and a 3' untranslated region of 476 
nucleotides (1). We undertook a systematic search for DNA polymorphism by 
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analyzing the RPA70 cDNA from 36 unrelated individuals using the single strand 
conformation polymorphism technique (described in the methods section). Primers 
were designed using the sequence of Erdile et al. (GenBank accession M63488; see ref. 
1). SSCP analysis revealed 5 variances, and subsequent DNA sequence analysis of 
5 those variances led to identification of four additional variances. SSCP revealed the 

variances at nucleotides 81 (G vs. A), 1 120 (A vs. G), 1674 (T vs. C), 2050 (T vs. C) 
and 2297, where an insertion/deletion variance of one C nucleotide was observed (8 
v5. 9 C's in a row). In the course of sequencing around the nucleotide 2297 
polymorphism an additional variance was detected at nucleotide 2341 (A vs. G). Also, 

10 while sequencing additional Swedish individuals around nucleotide 1 120 two new 

variances were observed at nucleotides 1 124 and 125 (both C vs. T). Finally, in three 
individuals sequenced for the 2050 variance we noted a difference from the published 
sequence at nucleotide 2046: we detect 3 Ts while the published clone shows just two. 
This difference may represent another insertion/deletion polymorphism. Five of the 

15 nine detected variances are in the coding sequence while four are in the 3' untranslated 

region. 

The frequency of heterozygotes for the five SSCP positive variances ranged from 25- 
42% among the 36 individuals tested. The small number of individuals genotyped for 
the other four variances precludes definitive assessment of heterozygosity rates. Some 

20 of the polymorphisms appear to occur more commonly in certain racial or ethnic 

groups (see Target Summary sheet for details). For example, only one of the variances 
(nt 1674) was detected in Japanese individuals. In general, higher levels of 
polymorphism were detected in North American Whites than in other groups. The 
nucleotide 1120 polymorphism, for instance, was heterozygous in 9/36 individuals 

25 overall (25%), but in 8/1 6 North American Whites (50%). 



The RPA70 cDNA encodes a 616 amino acid protein. The nucleotide 1 120 and 1 124 
variances result in amino acid substitutions at residues 351 and 352, the former an 
alanine-threonine exchange (approximately 50% of Caucasians are heterozygotes) and 
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the latter a serine-phenylalanine exchange (rare in the populations tested). In the 
recently published crystal structure of the DNA binding segment of RPA70 (amino 
acids 1 8 1 -422) it is possible to place residue 35 1 in the second of two tandemly arrayed 
DNA binding domains (domain B; see ref. 10). Domain B extends from residue 1305 
5 to N402, thus the variant residue 351 is in the middle. The published structure is a co- 

crystal of RPA70 amino acids 181-422 complexed to octadeoxycytosine. Several 
RPA70 residues contact the oligonucleotide (Figure 4 of ref. 1 1), including amino 
acids K343 and T359, which lie 8 residues away from the polymorphism in either 
direction. Modeling the two variant forms of the protein using the atomic coordinates 

10 deposited in the Protein Data Bank (1JMC) should clarify the structural consequences 

of the alanine-threonine variance. Residue 35 1 lies in the center of a 50 amino acid 
segment of the protein that is relatively poorly conserved between yeast and man: 1 1 
of the 50 residues are identical and 25 more are conservative substitutions. Towards 
the C terminus there is strong conservation: starting 25 residues C-terminal of the 

15 polymorphism, 27 of the next 37 residues are identical between yeast and man. 

Towards the N terminus there is -30% conservation. Both yeast and human 70 kD 
RPA subunits contain putative C4-type zinc finger motifs at positions ^80-500. 

The RPA 70 gene maps to chromosome 1 7pl3.3 

20 

The gene for RPA70 has been mapped to chromosome band 17pl3.3 by in situ 
hybridization (12). Only one locus was detected. 

Chromosome band 1 7pl3. 3 is a site of frequent loss of heterozygosity. RPA70 lies just 
25 telomeric to the TP53 tumor suppressor gene which is located in cytogenetic band 

1 7pl 3. 1 . This region of chromosome 1 7 is extremely well investigated for allele loss. 
In general, studies report LOH in approximately 40-60% of breast cancers (13-21 ), 50- 
70% of colon cancers (22-28), 25-75% of ovarian cancers (29-33), 20-60% of stomach 
cancers (34-37), 20-50% of brain cancers (38,39), 45-70% of esophageal cancers (40), 
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35-65% of non-small cell lung cancers (41,42) and 100% of small cell lung cancers, 
15-50% of cervical cancers, 30-80% of head and neck cancers, 20-60% of liver 
cancers, over 50% of sarcomas and 10-30% of a variety of other cancer types. 

Assays developed for RPA: Protein and DNA contacts 



10 



15 



20 



25 



Human cDNAs encoding all 3 subunits (70, 34 and 1 1 kD) of RPA have been cloned 
and expressed in E. coli and in insect cells via baculovirus vectors. The bacterially 
expressed 70 kDa protein is indistinguishable from its purified human counterpart 
immunologically and in several functional assays (see Table below). There is good 
evidence that the 70 kD subunit of RPA interacts with a number of different molecules. 
A partial list would include the 34 and 1 1 kD subunits of RPA, DNA, the xeroderma 
pigmentosum damage recognition and endonuclease proteins XPA and XPG, and DNA 
polymerase a-primase. These experimentally proven contacts (and almost certainly 
others) may constrain the topology of the protein in ways that have implications for 
inhibitor design. In summary a broad array of assays exists to screen for small 
molecule inhibitors of RPA (possibly including modified nucleotides), that act via 
competitive, allosteric or protein-protein blocking mechanisms. 
Table 4 

Assays and reagents available for RPA inhibitor screening 

RPA 70 kD, Assay Systems 



30 





Purified Purified Bacterial or 




Human Protein 


Baculovirus 






Protein 


ASSAY 






Immunoreactivity 


X 


X 


Single stranded DNA binding 


X 


X 


DNA Polymerase alpha 


X 


X 


primase 
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DNA strand exchange 
Nucleotide excision repair 
Support SV40 Replication 



X 


X 


X 


X 


X 


X 
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Example 25: RNA Polymerase II, 220-kD subunit (RPOL2A) - Target Gene 
VARIA500 

The human RPOL2A gene encodes a protein essential for cell survival 
DNA-dependent RNA polymerase II (also known as RPB1 or POLR2A), a complex 
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multisubunit enzyme, is responsible for the transcription of mRNA from all protein 
coding genes. 

RNA polymerases are found in all cellular organisms. The subunit structure 
of RNA polymerases is highly conserved in eukaryotes. RNA polymerase acts 
in concert with as many as 50 other proteins in gene transcription (reviewed in 
ref. 1). See refs. 2 and 3 for a review of basal transcription by RNA 
polymerase II and recent progress in identifying and purifying transcription 
factors and cloning the genes that encode them. 

Several subunits of S. cerevisiae RPOL2A have been disrupted, always 
resulting in non-viable yeast. 

A variety of inhibitors of RNA polymerase are cytotoxic drugs, such as 
actinomycin D, which intercalates into double stranded DNA and blocks the 
movement of RNA polymerase; rifampicin binds the b subunit of E. coli RNA 
polymerase and blocks initiation of transcription. The best studied specific 
inhibitor of eukaryotic RPOL2A, however, is the potent mushroom toxin - 
amanitin, a cyclic octapeptide which binds with high affinity (Kd -10-9 M) to 
RPOL2A. Several mutations conferring resistance to a-amanitin have been 
characterized and they all map to the RPOL2A protein coding sequence. 
Recently a-amanitin binding has been shown to trigger specific degradation of 
RPOL2A (4). 

Damage to actively transcribed DNA is preferentially repaired by the 
transcription-coupled repair (TCR) system. TCR requires RNA pol II, but the 
mechanism by which repair enzymes preferentially recognize and repair DNA 
lesions on PolB II-transcribed genes is incompletely understood. 

The human RPOL2A gene and mRNA have sequence variances 

Wintzerith et al. and later Mita et al. cloned and sequenced the complete human gene 
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for RP0L2A (5, 6); the deduced amino acid sequences are identical. The RPOL2A 
gene contains 29 exons and spans about 32 kb of DNA. The cDNA sequence we 
evaluated is 6732 nucleotides long (see Annotated RPOL2A Sequence) and contains 
a 5' untranslated region of 386 nucleotides, a 5910 nucleotide coding region specifying 
1970 amino acids, and a 436 nucleotide 3* untranslated region (see annotated 
sequence). We undertook a systematic search for DNA sequence variance in the 
cDNA of RPOL2A by analyzing 36 unrelated individuals using the single strand 
conformation polymorphism technique. Primers were designed for amplification. 
SSCP analysis revealed 10 sequence variances, and subsequent DNA sequence analysis 
confirmed a G vs. A transition at nucleotide 857, a C vs. T transition at nucleotide 
1260, a C vs. T transition at nucleotide 1346, a C vs. T transition at nucleotide 1544, 
a C vs. T transition at nucleotide 1847, a C vs. T transition at nucleotide 2678, a C vs. 
T transition at nucleotide 3059, a C vs. T transition at nucleotide 3827, a T vs. C 
transition at nucleotide 6466 and a T vs. C transition at nucleotide 6557. The former 
seven sequence variances are in coding sequence and the latter two are in the 3' 
untranslated sequence. Only one of the ten sequence variances alters the protein 
coding sequence: the nucleotide 1260 alleles encode arginine (common) or cysteine 
(rare) at amino acid 292. Only 2/36 individuals surveyed are heterozygotes (6%), 
however both are North American Whites (2/16 = 12.5%) so further investigation of 
this population is required. The prevalence of heterozygotes for the other sequence 
variances varies from 3% to 50%, with 6 sequence variances above 22% (see RPOL2A 
Target Summary Sheet). The 6 common sequence variances are widely prevalent 
among all or nearly all the tested populations. 

The human RPOL2A gene maps to chromosome 17pl3A05 

The human RPOL2A gene was initially assigned to the distal portion of the short arm 
of chromosome 1 7 (1 7pter-pl2) by in situ hybridization and Southern analysis of DNA 
from human/rodent somatic cell hybrids (7, 8). Subsequent somatic cell hybrid studies 
narrowed the assignment to 17pl3.105-pl2 [vanTuinen and Ledbetter (1987)], which 
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was later confirmed by in situ hybridization to 17pl3 (9). 

Chromosome band 17pl3.1 is a site of frequent loss of heterozygosity There have 
been many studies of LOH on 17p, particularly the 1 7p 13.1 region where the p53 
5 tumor suppressor gene maps. Virtually all cancer types have been surveyed for LOH 

in this area, with particularly extensive studies of breast, colon, ovarian, and stomach 
cancers. These studies report LOH in approximately 40-60% of breast cancers (10-1 8), 
50-70% of colon cancers (19-25), 25-75% of ovarian cancers (26-30), 20-60% of 
stomach cancers (31-34), 20-50% of brain cancers (35,36), 45-70% of esophageal 
10 cancers (37), 35-65% of non-small cell lung cancers (38,39) and 100% of small cell 

lung cancers, 15-50% of cervical cancers, 30-80% of head and neck cancers, 20-60% 
of liver cancers, over 50% of sarcomas and 10-30% of a variety of other cancer types. 
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Example 26: TATA Associated Factor 30 kD subunit (TAF2H) - Target Gene 
25 VARIA 520 

The human TAF2H gene encodes a component of the transcriptional apparatus 
Transcription initiation by RNA polymerase II requires the assembly of a complex of 
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basic transcription factors which include TFIIA, TFIIB, TFIID, TFIIE, TFIIF, 
TFIIG/TFIIJ and TFIIH/BTF2 into a preinitiation complex (1,2). TFIID is the first 
factor to contact the promoter, and subsequent assembly of the transcription complex 
is dependent on TFIID binding. TFIID is a 700-750 kD multiprotein complex which 
includes TATA binding protein (TBP) and between eight and 13 TBP-associated 
factors (TAFs) ranging from 250 to 17 kDa. The TAFs have been shown necessary to 
reconstitute activation of transcription in vitro, leading to the hypothesis that some 
TAFs link transcription activation domains to the basal transcription complex. The 
TFIID complex also supports transcription from TATA-less promoters, while TBP 
fails to do so. Therefore TAFs may also contribute to formation of stable initiation 
complexes by interacting directly with DNA (2). Conditional temperature sensitive 
Chinese hamster mutants of another TAF, TAFII250, were detected because, at the 
non-permissive temperature, DNA synthesis was inhibited leading to arrest of cell 
division at the Gl phase (3,4). Transfection of a human TAFII250 gene relieved the 
block at the non-permissive temperature. Thus an essential role has been proven for 
TAFs in mammalian cells. 

A gene (TAF2H) encoding the 30 kDa human TAF protein (TAFII30) was cloned and 
its functional properties examined by Jacq, et al. (5). The protein was shown to be 
present in a subset of TFIID complexes and to mediate transcriptional activation by a 
specific region of the estrogen receptor. Estrogen mediated transcriptional activation 
could be abrogated by adding an antibody against TAFII30. TAFII30 was not required 
for basal transcription or for transcription activation by VP- 16. It is likely that 
TAFII30 is required for transcriptional activation by a variety of other transactivating 
proteins, and is therefore essential for cell proliferation or cell survival. 
The human TAF2H gene and mRNA have sequence variants 

A human TAF2H cDNA has been cloned and sequenced (5). It encodes a cDNA of 
756 nucleotides including a 5' untranslated region of 1 7 nucleotides, a 657 nucleotide 
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coding region specifying 218 amino acids, and an 82 nucleotide 3' untranslated region 
(GenBank accession U13991; see annotated TAF2H cDNA sequence). (Note that the 
numbering of the sequence in ref. 5 differs slightly from that in the GenBank 
accession.) We undertook a systematic search for DNA variance in the cDNA of 
5 TAF2H by analysing 36 unrelated individuals using the single strand conformation 

polymorphism technique Primers were designed for amplification. SSCP analysis 
revealed 1 polymorphism, and subsequent DNA sequence analysis confirmed a G vs. 
A transition at nucleotide 554 (nt 556 of the sequence in ref. 3) of the coding sequence. 
This variance does not alter the protein coding sequence. Eight of 36 individuals 
10 surveyed are heterozygotes (22%). The variance occurs in North American Whites 

(3/16 = 19%), North American Blacks (2/4) and Hispanics (3/3). 

The human TAF2H gene maps to chromosome Ilpl5.5-pl5.2 The human TAF2H 
cDNA has been mapped to 1 lpl 5.5-pl 5.2 by fluorescent in situ hybridization (6). 
15 There appears to be a single TAF2H locus. Chromosome band Ilpl5-pl4 is a site of 

frequent loss of heterozygosity 

There have been many studies of LOH on lip, particularly the 11 pi 5 and llpl3 
segments where the Beckwith-Weidemann syndrome and WT1 genes reside. As a 
result there are many studies of LOH in llpl5.5, particularly focusing on breast, 
cervix, kidney, liver, lung, ovarian, stomach and testicular cancers. These studies show 
that the 1 lp 15.5 band of chromosome 1 1 is frequently reduced to one copy (7-24). For 
example, LOH occurs in approximately 13-33% of breast cancers (7-9), 14-42% of 
cervical cancers (10), 0-50% of liver cancers (11,12), 0-80% of lung cancers (13-15), 
18-54% of ovarian cancers (14,15), 0-71% of stomach cancers (18) and 0-50% of 
testicular cancers (19,20). Other studies show that 1 lpl5.5 LOH may also be frequent 
in bladder cancer (21), esophageal cancer (22), some leukemias (23) and sarcomas 
(24). Many deletions in the llpl5.5 region span relatively short chromosomal 
segments (2-10 megabases; see ref. 13), 
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Example 27 - cDNA synthesis 



10 



In order to analyze an essential gene for sequence variances, it is generally useful to 
have a cDNA(s) containing the coding sequence for further sequencing or 
amplification purposes. cDNAs for some genes are available, however, in some cases 
it is useful to synthesize the cDNA de novo. Methods for obtaining cDNA are known 
15 t0 those skilled in the art, as are methods for sequencing or amplifying the cDNA or 

portions thereof. An example of a useful cDNA production protocol is provided 
below, however, as recognized by those skilled in the art, other specific protocols can 
also be used. 

20 cDNA Production 

** Make sure that all tubes and pipette tips are RNase-free. (Bake them 
overnight at 1 OOoC in the vacuum oven to make them RNase-free.) 

1 Add the following to a RNase-free 0.2 ml micro-amp tube and mix gently: 
25 24 ul water (DEPC treated) 

12 ul RNA(lugAil) 

1 2 ul random hexamers(50 ng/ul) 

2 Heat the mixture to 70oC for ten minutes. 

3 Incubate on ice for 1 minute. 
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4 Add the following: 

16 ul 5 X Synthesis Buffer 

8 ul 0.1MDTT 
5 4 ul 1 0 mM dNTP mix ( 1 0 mM each dNTP) 

4 ul Superscript RT II enzyme 
Pipette gently to mix. 

5 Incubate at 42oC for 50 minutes. 

6 Heat to 70oC for ten minutes to kill the enzyme, then place it on ice. 
10 7 Add 1 60 ul of water to the reaction so that the final volume is 240 ul. 

8 Use PCR to check the quality of the cDNA. Use primer pairs that will give a 
-800 base pair long piece. See "PCR Optimization" for the PCR protocol. 

The following chart shows the reagent amounts for a 20 ul reaction, a 80 ul 
15 reaction, and a batch of 39 (which makes enough mix for 36) reactions: 



20 



25 





20 ul X 1 tube 


80 ul X 1 tube 


80ulX39 tubes 














water 


6ul 


24 ul 


936 


water 


RNA 


3ul 


12 ul 




RNA 


random hexamers 


3ul 


12 ul 


468 


random hexamers 












synthesis buffer 


4ul 


16 ul 


624 


synthesis buffer 


0.1 MDTT 


2ul 


8ul 


312 


0.1 MDTT 


lOmMdNTP 


1 ul 


4ul 


156 


lOmM dNTP 


SSRT 


1 ul 


4ul 


156 


SSRT 
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Example 28 - Variance detection by SSCP 



WO 98/41648 



PCT/US98/05419 



274 232/116 
This example describes the SSCP technique as used for the identification of sequence 
variances of the exemplary genes, which were then sequenced to confirm the specific 
base variances. One common technique currently employed in the identification of 
such single nucleotide differences is the single strand conformation polymorphism 
(SSCP) method, (originally described in Orita, et al., "Rapid and Sensitive Detection 
of Point Mutations and DNA Polymorphisms Using the Polymerase Chain Reaction, 
Genomics, 5:874-879 (1989)) Also employed are restriction fragment length 
polymorphism (RFLP), heteroduplex analysis, ligase chain reaction (LCR), denaturing 
gradient gel electrophoresis (DGGE) (Myers, Maniatis, and Lerman, Methods 
Enzymol, 155:501-527 (1987)) or direct nucleotide sequencing. A review of 
polymorphism detection techniques, including SSCP, is provided in Grompe, 1993, 
Nature Genetics 5:111-117, which includes a comparison of the commonly used 
methods. 



15 The SSCP method reveals the presence of sequence variation between individuals as 

shifts in electrophoretic mobility, but does not show the sequence itself. Direct 
sequencing of DNAs with altered mobility in the SSCP assay identifies the precise 
nucleic acid sequence differences among the various alleles. From the nucleic acid 
sequence data, the amino acid sequence can be determined. One example of the use 
20 of this technique is in Pelletier et al., Cell, 67:437-447 (1991). The single strand 

conformation polymorphism methodology is effective for scanning essential genes for 
sequence variants. It remains the standard technique in human genetics for variance 
detection, with numerous studies of its efficacy (>90%) and schemes for improved 
throughput. The SSCP method has been shown to be quite sensitive in the detection 
of single base changes, for example as shown in Ravnik-GIava et al., 1994, Human 
Mol. Genet. 3:801-807 (human cystic fibrosis gene) and Glava & Dean, 1993, Human 
Mutation 2:404-414 (mouse -globin gene). 

A flow chart of the SSCP method as used to identify essential gene sequence variants 
is shown in Fig. 2 (SSCP OVERVIEW). The method involves the steps of 1) PCR 
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amplifying a portion of an essential gene cDNA of known sequence (labeled products), 
2) selecting restriction enzymes which will produce fragments approximately 100-400 
bases in length for 3 independent digestions of the PCR products, 3) heat denaturing 
the digestion products, 4) running single strand digestion products on non-denaturing 
gels, 5) identifying bands having different mobilities when compared between 
individuals, thereby identifying potential sequence variants, 6) sequence at least the 
region around the potential sequence variance, that region being identified by 
comparison of the expected fragment sizes resulting from the digestions, 7) record the 
specific location and base identity of the confirmed sequence variant, 8) calculate the 
percent occurrence of each sequence variance for the gene as found for the sample of 
the population. The method is further described in Example 2. 

Single strand conformation polymorphism screening is a widely used technique for 
identifying an discriminating DNA fragments which differ from each other by as little 
as a single nucleotide. As originally developed by Orita (supra), the technique was 
used on genomic DNA, however the same group showed that the technique works very 
well on PCR amplified DNA as well. In the last 8 years the technique has been used 
in hundreds of published papers, and the modifications of the technique have been 
described in dozens of papers. The enduring popularity of the technique is due to (1) 
a high degree of sensitivity to single base differences (>90%) (2) a high degree of 
selectivity, measured as a low frequency of false positives, and (3) technical ease. 
SSCP is almost always used together with DNA sequencing because SSCP does not 
directly provide the sequence basis of differential fragment mobility. The basic steps 
of the SSCP procedure are described below and summarized in Fig. 2 in flow chart 
form. 

Because the intent of our SSCP screening was to identify as many target gene 
variances as practically possible, we developed a protocol designed to look at a 
relatively large number of individuals (36) with a high degree of redundancy, so as to 
minimize both the false negative and false positive rates. 
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The 36 individuals examined are reasonably representative of most of the 
worlds major populations. The racial or geographic origin of the 36 cell lines is 
detailed in the Target Summary Tables (Figure 5). All cell lines are EBV immortalized 
lyphoblastoid cells obtained from the Coriell Cell Repository (Camden, NJ), which 
5 includes the racial/ethnic/geographic background of cell line donors in its catalog. The 

cell lines were also selected for their rapid growth rates. In several cases a panel of 
cDNAs isolated from French Canadians was used instead, or in addition to, the Coriell 
panel. 



10 SSCP was used to analyze cDNAs (rather than genomic DNAs) because in many cases 

the full genomic sequence of the target gene is not available, however, the technique 
is also applicable to genomic sequences. To produce cDNA requires RNA. Therefore 
each of the 36 cell lines was grown to mass culture and RNA was isolated using the 
acid/phenol protocol, sold in kit form as TRIAZOL™ by Life Technologies 

15 (Gaithersberg, MD). The unfractionated RNA was used to produce cDNA by the 

action of a modified Maloney Murine Leukemia Virus Reverse Transcriptase, 
purchased in kit form from Life Technologies (SUPERSCRIPT II™ kit). The reverse 
transcriptase was primed with random hexamer primers to initiate cDNA synthesis 
along the whole length of the RNAs. This proved useful later in obtaining good PCR 

20 products from the 5' ends of some genes. 

Material for SSCP analysis was prepared by PCR amplification of the cDNA in the 
presence of one 32 P labeled dNTP (usually 32 PdCTP). Usually the concentration of 
nonradioactive dCTP was dropped from 200 uM (the standard concentration for all 
four dNTPs) to about 100 uM, and 32 P dCTP was added to a concentration of about 

25 0.1-0.3 uM. This involved adding a 0.3- 1 ul (3-10 uCi) of 32 P cCTP to a 10 ul PCR 

reaction. All radioactivity was purchased from DuPont/New England Nuclear. 



The customary practice is to amplify about 200 base pair PCR products for SSCP, 
however, we found that it was preferable to amplify about 0.8-1.4 kb fragments and 
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then use several cocktails of restriction endonucleases to digest those into smaller 
fragments of about 0. l-0.4kb, aiming to have as many fragments as possible between 
.15 and .3 kb. The digestion strategy had the advantage that less PCR was required, 
reducing both time and costs. Also, we routinely performed three different digests on 
each sample (for all 36 cDNAs), and then ran each of the digests separately on SSCP 
gels. This had the effect of increasing the redundancy of our method, lessening both 
the false negative and false positive rates. For example; a site of variance might lie 
within 2 bases of the end of a fragment in one digest, and as a result not affect the 
conformation of that strand; the same variance, in a second or third digest, would likely 
lie in a location more prone to affect strand folding, and therefore be detected by SSCP. 

After digestion, the radiolabeled PCR products were diluted 1 :5 by adding formamide 
load buffer (80% formamide, IX SSCP gel buffer) and then denatured by heating to 
90%C for 10 minutes, and then allowed to renature by quickly chilling on ice. This 
procedure (both the dilution and the quick chilling) promotes intra- (rather than inter-) 
strand association and secondary structure formation. The secondary structure of the 
single strands influences their mobility on nondenaturing gels, presumably by 
influencing the number of collisions between the molecule and the gel matrix (i.e., gel 
sieving). Even single base differences consistently produce changes in intrastrand 
folding sufficient to register as mobility differences on SSCP. 

The single strands were then resolved on two gels, one a 5.5% acrylamide, 0.5X TBE 
gel, the other an 8% acrylamide, 10% glycerol, IX TTE gel. The use of two gels 
provides a greater opportunity to recognize mobility differences. Both glycerol and 
acrylamide concentration have been shown to influence SSCP performance. The gel 
apparatus we use (from Owl Scientific, MA) allows 108 samples to be loaded per gel. 
Since all 36 samples are routinely digested with three different endonuclease mixes 
there are 108 samples to be analyzed for each PCR product. By routinely analyzing 
three different digests under two gel conditions (effectively 6 conditions), and by 
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looking at both strands under all 6 conditions, we achieve a 12-fold sampling of each 
base pair of cDNA. 



All of the sequence variances described in this disclosure were determined by DNA 
5 cycle sequencing of 32 P labeled PCR products using the femtomole DNA cycle 

sequencing kit from Promega (WI) and the instructions provided with the kit. 
Fragments were selected for DNA sequencing based on their behavior in the SSCP 
assay. 



10 

Example 29 - Variance detection by using T4 endonuclease VII mismatch 
cleavage method 



The enzyme T4 endonuclease VII is derived from the bacteriophage T4. T4 
15 endonuclease VII is used by the bacteriophage to cleave branched DNA intermediates 

which form during replication so the DNA can be processed and packaged. T4 
endonuclease can also recognize and cleave heteroduplex DNA containing single base 
mismatches as well as deletions and insertions. This activity of the T4 endonuclease 
VII enzyme can be exploited to detect sequence variances present in the general 
20 population. 



The following are the major steps involved in identifying sequence variations in a 
candidate gene by T4 endonuclease VII mismatch cleavage: 

1 . Amplification by the polymerase chain reaction (PCR) of 400-600 bp regions 
of the candidate gene from a panel of DNA samples The DNA samples can 
either be cDNA or genomic DNA and will represent some cross section of the 
world population. 

2. Mixing of a fluorescently labeled probe DNA with the sample DNA. Heating 
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and cooling the mixtures causing heteroduplex formation between the probe 
DNA and the sample DNA. 

3. Addition of T4 endonuclease VII to the heteroduplex DNA samples. T4 
endonuclease will recognize and cleave at sequence variance mismatches 
formed in the heteroduplex DNA. 

4. Electrophoresis of the cleaved fragments on an ABI sequencer to determine the 
site of cleavage. 

5. Sequencing of a subset of PCR fragments identified by T4 endonuclease VI to 
contain variances to establish the specific base variation at that location. 

A more detailed description of the procedure is as follows: 



A candidate gene sequence is downloaded from an appropriate database. Primers for 
PCR amplification are designed which will result in the target sequence being divided 
15 into amplification products of between 400 and 600 bp. There will be a minimum of 

a 50 bp of overlap not including the primer sequences between the 5 1 and 3' ends of 
adjacent fragments to ensure the detection of variances which are located close to one 
of the primers. 



20 Optimal PCR conditions for each of the primer pairs is determined experimentally. 

Parameters including but not limited to annealing temperature, pH, MgCl 2 
concentration, and KC1 concentration will be varied until conditions for optimal PCR 
amplification are established. The PCR conditions derived for each primer pair is then 
used to amplify a panel of DNA samples (cDNA or genomic DNA) which is chosen 

25 to best represent the various ethnic backgrounds of the world population or some 

designated subset of that population. 

One of the DNA samples is chosen to be used as a probe. The same PCR conditions 
used to amplify the panel are used to amplify the probe DNA. However, a 
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flourescently labeled nucleotide is included in the deoxy-nucleotide mix so that a 
percentage of the incorporated nucleotides will be fluorescently labeled. 

The labeled probe is mixed with the corresponding PCR products from each of the 
DNA samples and then heated and cooled rapidly. This allows the formation of 
heteroduplexes between the probe and the PCR fragments from each of the DNA 
samples. T4 endonuclease VII is added directly to these reactions and allowed to 
incubate for 30 min. at 37 C. 10 ul of the Formamide loading buffer is added directly 
to each of the samples and then denatured by heating and cooling. A portion of each 
of these samples is electrophoresed on an ABI 377 sequencer. If there is a sequence 
variance between the probe DNA and the sample DNA a mismatch will be present in 
the heteroduplex fragment formed. The enzyme T4 endonuclease VII will recognize 
the mismatch and cleave at the site of the mismatch. This will result in the appearance 
of two peaks corresponding to the two cleavage products when run on the ABI 377 
sequencer. 

Fragments identified as containing sequencing variances are subsequently sequenced 
using conventional methods to establish the exact location and sequence variance. 

Example 30 - Identification of Sequence Variances by Informatics-based analysis 
of gene-sequence databases 

In addition to and/or in conjunction with the molecular biology based approaches for 
identifying sequence variances in genes, particularly in essential genes, such sequence 
variances can be identified by analysis of public and/or private genetic sequence 
databases. Such information can be either genomic or cDNA sequence information. 

The data base analysis process includes the following major steps: 
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1. 

1. capture of homologous sequences of a particular gene from data bases. It is 
preferable to obtain a large number of independent sequences of a particular 
gene 

2. analysis of collected sequences of a particular gene to identify authentic 
sequence variances. This step involves the discrimination of authentic 
sequence variances, which are sequence variances which actually exist in the 
population, from sequencing errors and artifacts. It is expected that about 0. 1 - 
0.3% of the bases will occur as true variances, while the frequency of 
sequencing artifacts is expected to be 1-3%. This discrimination utilizes the 
expected frequencies of occurrence of specific types of nucleotide sequence 
changes. Such information includes the characteristic frequency of specific 
transitions and transversions and of the characteristic frequency of deletions 
and insertions in authentic variations. It uses the frequency of occurrence of 
known types of sequencing artifacts such as single base insertions or deletions 
adjacent to repeated C or G nucleotides. Additional information for such 
discrimination is provided if particular putative authentic variations are 
observed in multiple independently derived sequences of the gene. 



An implementation of this sequence variance identification process utilizes a reference 
sequence of an essential gene. Preferably, the reference sequence is a high quality 
sequence, meaning that there is a low frequency of occurrence of sequencing errors or 
artifacts. The second step is the retrieval of allelic sequences of that essential gene 
from available databases such as the BLAST server, the UNIGENE database, or other 
such sequence database. Such allelic sequences need not be complete, but are 
preferably long enough to ensure that they are in fact allelic sequences. The third step 
involves alignment analysis to identify and tabulate sequence differences between the 
different available sequences. An algorithm for such analysis is the Smith- Waterman 
local alignment algorithm. Use of an algorithm of this type involves a series of pair- 
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wise alignments of each retrieved sequence with the reference sequence. The fourth 
step involves analysis of the observed sequence differences and assignment of a 
probability that each sequence difference represents an authentic variance. This 
analysis utilizes program filters which are combined in a weighted fashion to determine 

5 a final probability . Such program filters include comparison of the observed 

difference with common mutational changes and sequencing errors, a weighting of the 
reliability of a particular retrieved sequence based on the total number of differences 
observed, a weighting based on the location within a retrieved sequence where a 
change was observed and a significant weighting based on the observance of a 

0 particular difference in multiple independently derived retrieved sequences. 

Using such an implementation, a database analysis with respect to a particular 
reference sequence produces a list of putative authentic sequence variances and a 
probability for each of those variances that the sequence difference is an authentic 
5 variance. As described above, the probability is obtained through the use of a series 

of weighted program filters and thus these filters are modified to produce optimal 
authentic variance discrimination. 



Example 31 - Antiproliferative effects of variance specific inhibition of RPA70 

This example describes experiments showing the practicality and utility of variance- 
specific inhibition of essential genes for cancer therapy. Specifically, this example 
describes in vitro experiments showing the design and production of variance-specific 
oligonucleotides for antisense inhibition of variant alleles of the essential Replication 
Protein A, 70 kDa subunit (RPA70) for inhibition of RPA70 mRNA, and the use of 
these oligonucleotides to inhibit cell proliferation and to reduce the number of cells in 
a variance-specific manner. 

Variance-specific inhibition and cell killing with antisense oligonucleotides against 
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RPA70 

These experiments with RPA70 illustrate the feasibility of each of the steps for 
development of a variance specific inhibitor: 

Select candidate target gene essential for cell survival or proliferation. As described 
above, RPA is essential for replication in prokaryotic and eukaryotic cells, 
mitochondria, phage, viruses and in in vitro (SV40) replication systems. The protein 
is a heterotrimer required for loading DNA polymerase onto the DNA template during 
cell replication. The 70 kDa subunit, RPA70, is a single strand binding protein that 
mediates the interaction of RPA with DNA. Without this protein, the replication 
complex does not associate with DNA and the replication of DNA does not occur. 

Confirm chromosome location and LOH frequency. RPA70 is encoded by a single 
gene locus on chromosome 17pl3.3, immediately adjacent to the p53 gene at 17pl3.1. 
LOH involving chromosome band 17pl3.3 has been documented in 50-70% of colon, 
lung, breast, and ovarian cancers. LOH at this locus also occurs in other cancers. The 
inventor as confirmed LOH involving RPA 70 in breast, colon, lung and other cancers. 

Identify common variances in the normal population. We have identified five common 
variances in the RPA70 gene (Figure 8). The most common occurs in 42% of the 
normal population. One variance alters the amino acid sequence and is present in 25% 
of the normal population (44% of Caucasians). This variance occurs within the active 
DNA binding domain (discussed below). These variances are described in the 
description above and in Fig. 1 . 

Demonstrate antiproliferative effects due to inhibition of candidate gene. The inventor 
has shown that inhibition of RPA70 in T24 bladder carcinoma cells with an antisense 
oligonucleotide reduces cell number. This effect is comparable to treatment of these 
cells with antisense oligonucleotide against ras, previously shown to have antitumor 
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effects in vitro and in vivo (Figure 9). 

Design variance-specific inhibitor. Variance specific antisense oligonucleotides were 
designed to differentially inhibit the two variant forms of RPA70. Experiments were 
performed using tumor cell lines that are homozygous for each form of the target gene. 
Figure 10 shows inhibition of mRNA levels in Mia Paca II cells by the 13085 
oligonucleotide which matches the variance in these cells. In contrast, in T24 cells 
(and A549 cells, see below) the 12781 oligonucleotide matches the target gene and 
inhibits mRNA levels. In both cell lines neither the control oligonucleotide differing 
by one base (13085 in T24 cells and 12781 in Mia Paca II cells) nor a random- 
sequence oligonucleotide control (13706) inhibit mRNA levels to the same extent as 
the matched oligonucleotide. 

Figure 10 demonstrates that the RPA 70 mRNA can be specifically down regulated in 
an allele-specific manner. However, the 13085 oligomer used also has a small effect 
on the level of the unmatched RNA. In order to increase the discrimination we altered 
the structure of the targeting oligomer, 13085. The results are shown in Figure 1 1. By 
shortening the oligomer we retain its ability to down-regulate its matched target RNA 
(Mia Paca II cells, right half of Figure 11). Strikingly, however, this alteration 
dramatically altered the ability of this oligomer to down-regulate the mismatched 
variant RNA T24 cells, left half of Figure 1 1 . The reciprocal regulation by oligomer 
1 278 1 was augmented by altering transfection conditions. These data suggest that even 
simple changes to the rudimentary "first generation" chemistry and transfection 
techniques can have significant effects in enhancing the ability of the oligomers to 
recognize and down regulate specific mRNAs. 

Achieve variance-specific antiproliferative effects in cancer cells. Cell proliferation 
in each cell line, determined by BrdU incorporation, was suppressed to a greater degree 
by the matched oligonucleotide than by the controls differing by one base (Figure 12). 
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Cell proliferation in A549 cells was inhibited by oligomer 12781 to a greater degree 
than by oligomer 13085. Cell proliferation in Mia Paca 1 1 cells was inhibited more 
by oligomer 13085. 

Additional studies were performed to characterize the antiproliferative effect in A549 
cells (12781 genotype). A dose response curve demonstrates inhibition of BrdU 
incorporation by the matched oligonucleotide (12781) at concentrations 8-fold lower 
than the oligonucleotide with one base mismatch (13085) (Figure 13). 

Cell survival was measured by staining cells with Sulforhodamine B dye 72 hours after 
treatment with oligonucleotides. Dose dependent reductions in cell number were 
observed in cells treated with the matched oligonucleotide (12781) but not with an 
oligonucleotide containing the one base mismatch (13085) (Figure 14). In contrast, in 
Mia Paca II cells, more cell killing was observed with the 13085 oligonucleotide than 
with the 12781 oligonucleotide (Figure 15). The oligonucleotides used in these studies 
have not been optimized for achieving allele-specific effects. Oligonucleotides using 
advanced chemistries can be utilized to optimize the potency and provide greater 
discrimination between variant targets at lower levels. 

Example 32 - variance specific inhibition of essential genes 

This example describes experiments showing the practicality and utility of variance- 
specific inhibition of essential genes for cancer therapy including RNA Pol II, and 
ribonucleotide reductase. Specifically, this example describes in vitro experiments 
showing the design and production of variance-specific oligonucleotides for antisense 
inhibition of variant alleles of the essential Ribonuclotide Reductase (RR), the design 
and production of variance-specific oligonucleotides against RR, and the use of these 
oligonucleotides to inhibit RR mRNA in a variance-specific manner. 
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Variance-specific inhibition of Ribonucleotide Reductase. 

Ribonucleotide Reductase (RR) is an essential gene of nucleoside metabolism. 
Inhibitors of this function are known to be cell lethal. Two variances were discovered 
5 at position 24 1 0 and 24 1 9. Oligonucleotides were synthesized to a sequence spanning 

these two variations. In one case the oligomer targeted the GnnnnnnnnA variation 
(oligomer Varia 2410GA or RR2410GA) and in the other case the oligomer targeted 
the AnnnnnnnnG variant (oligomer Varia 2410AG or RR2410AG). In Mia Paca II 
cells which contain the GnnnnnnnnA variance, the RR2410GA antisense oligomer 

10 dramatically knocked down the level of RR mRNA. However, the oligomer targeting 

the other variance, oligomer Varia 2410AG, had little to no effect on the level of 
mRNA (Figure 16). The reciprocal regulation was demonstrated in MDA-MB 468 
cells which express the other variance, AnnnnnnnnG (Figure 1 7). In these cells Varia 
24 10 AG dramatically lowered the level of RR mRNA. In contrast, Varia 2410GA had 

15 no effect on the level of mRNA. These data taken together, are another example of 

allele-specific targeting of gene expression. We are also determining the effect of 
down regulating RR gene expression on cellular growth. 



Example 33 - variance specific inhibition of essential genes using advanced 
oligonucleotide chemistries. 

This example describes experiments showing the practicality and utility of variance- 
specific inhibition of essential genes for cancer therapy. Specifically, this example 
describes in vitro experiments showing the design and production of variance-specific 
oligonucleotides for antisense inhibition of variant alleles of the essential 
Glutamyl/prolyl tRNA Synthetase (EPRS), the design and production of variance- 
specific oligonucleotides against EPRS, and the use of these oligonucleotides to inhibit 
EPRS mRNA in a variance-specific manner. 
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Glutamyl-prolyl-tRNA synthetase (EPRS) is an essential gene, required for the 
synthesis of both glutamic acid tRNA and proline tRNA. Without EPRS protein 
synthesis is blocked. Two variances were discovered in this gene at positions 2963 and 
2969 in the cDNA. We have demonstrated variance-specific inhibition of this gene 
5 with antisense oligonucleotides exploiting several different types of chemistry. 

The experiments described above with RPA70 and RR utilized phophorothioate 
chemistry. This chemistry was developed to achieve greater stability in vivo, and this 
compound ha been used in several successful clinical trials. Phosphorothioates, 

0 however have low affinity for the RNA target, and, consequently, relatively lower 

specificity. We have achieved improved variance-specific inhibition using alternative 
chemistries. Specifically, we have synthesized hybrid oligonucleotides that contain 
both phosphorothioate and nucleotides with higher affinities. These hybrids contain 
"wings" consisting of six nucleotides with a 2' sugar modification (ethoxy-methoxy 

5 radical at the 2' position) and either a phosphorothioate or phosphodiester backbone. 

Between the "wings" is a 8 nucleotide sequence of phosphorothioates that overlaps the 
variance. (In these constructs the 5' position of cytosine has been methylated.) As 
shown in Figure 18, variance specific inhibition is observed with the conventional 
phorphorothioates. Greater inhibition of target mRNA is observed using the hybrid 

0 chemistries at lower doses. Inhibition by the matched hybrid oligomer, 14977, occurs 

at approximately 50-100 nM. The effect is extremely oligomer-specific. The 
mismatched oligomer, 14971, has no effect on mRNA levels at concentrations as high 
as 400 nM (Figure 19). 

5 

Example 34 - in vivo cancer therapy using oligonucleotides 

This example describes reported in vitro and in vivo data on the treatment of cancer in 
animal models using antisense oligonucleotides against c-raf, showing the expected 
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correlation between in vitro suppression of mRNA and cell proliferation with 
oligonucleotides, and in vivo anticancer activity. 

In vitro evidence for inhibition of mRNA by antisense oligonucleotides and inhibition 
5 of cell proliferation is commonly used to predict in vivo effects on tumors. This is 

exemplified by the publication by Monia et al (Nature Medicine, Volume 2 Number 
6, June 1 996) who demonstrated anticancer effects using oligonucleotides against C-raf 
kinase. In vitro treatment of human tumor cells with appropriate phosphorothioate 
antisense oligomers led to specific inhibition of C-raf kinase gene expression and 

10 subsequent decrease in cellular proliferation, IC50=50- 1 OOnM. Administration of C- 

raf antisense oligomers to nude mice having a tumor burden derived from these cells 
significantly inhibited tumor growth in vivo f IC50= 0.06-0.6 mg/kg. Remarkably, the 
investigators were able to show that the anti-C-raf oligomers down-regulated the level 
of C-raf kinase mRNA in vivo by assaying mRNA levels in cells removed from the 

15 tumor. 



Example 35 - in vivo cancer therapy by oligonucleotide inhibition of ras 

This example describes reported in vivo data showing an anticancer effect using an 
allele-specific inhibitor for suppression of mutant H-ras. Schwab et al (Proc. Nat. 
Acad Sci. USA 91:10460-464, Oct 1994) demonstrated antitumor effects of an 
antisense oligonucleotide specific for the mutant ras in animal models. In these 
experiments HBL100 cells were transformed with the RAS oncogene. In vitro studies 
demonstrated that the RAS mRNA could be specifically down-regulated by a 
nanoparticle conjugated phosphodiester antisense oligomer. Only the transforming 
RAS mRNA was targeted by the oligomer. The normal cellular RAS mRNA, differing 
by a single base, was not affected by the antisense oligomer. The decrease in RAS 
expression was associated with a decrease in the growth rate of the cells. The 
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transformed HBL100 cells were injected into nude mice to form tumors; following 
subcutaneous injection of nanoparticle-conjugated phosphodiester antisense oligomers, 
Schwab et al measured both a decrease in targeted tumor weight and volume. 
Specificity for tumor cell growth correlated well with the in vitro data having a 5-fold 
differential between antisense and control groups. 

The authors of this paper are proceeding with clinical trial of these oligonucleotides for 
the treatment of cancer, demonstrating the potential clinical utility of these methods. 



Example 36. Variance detection by DGGE 



This example describes denaturing gradient gel electrophoresis (DGGE), a technique 
used for the identification of DNA sequence variances in genomic DNA, cDNA or in 

15 PCR products amplified from genomic DNA or cDNA. The DGGE method was 

originally described by Fischer and Lerman (Two Dimensional Electrophoretic 
Separation of Restriction Enzyme Fragments of DNA. Methods in Enzymology, vol. 
68: 183-191, 1979; DNA Fragments Differing by Single Base-Pair Substitutions are 
Separated in Denaturing Gradient Gels: Correspondence with Melting Theory. Proc. 

20 Natl. Acad. Sci. U.S.A. 80: 1 579, 1 983) and has been improved since then by many 

investigators. See, for example: Myers, et al., Mutation Detection by PCR, GC- 
Clamps, and Denaturing Gradient Gel Electrophoresis, pp. 71-88 in Erlich, H.A., 
editor: PCR TechnolPKv; Principles and Applications for DNA Ampiifirati™ 
Stockton Press, New York, 1989; Myers, et al., Detecting Changes in DNA: 

25 Ribonuclease Cleavage and Denaturing Gradient Gel Electrophoresis, in Davies, K.E., 

editor: Genomi c Analysis: A Practical Approach . IRL Press Ltd., Oxford, 1988, pp. 95- 
139; E.S. Abrams and V.P. Stanton Jr., Use of Denaturing Gradient Gel 
Electrophoresis, pp. 71-104 in Lilley, D.M.J, and Dahlberg, J.E., editors: DNA 
Structures , pan B: Chemical and Electronhoretic Analy s is of DNA , Methods in 
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Enzymology, volume 212, Academic Press, 1992; .) Descriptions of current 
applications of the technique can be found in 

The basic principal of DGGE involves the creation of a gradient of denaturant in a gel, 
which is then used to resolve double stranded DNA (or RNA) fragments on the basis 
of conformational differences associated with strand melting. The denaturant can be 
chemical (as in DGGE, where a gradient of formamide and urea is typically used) or 
thermal (as in a related technique called thermal gradient gel electrophoresis, or TGGE, 
where a gradient of heat is used). To obtain conditions where double stranded DNA 
is close to melting, DGGE gels are immersed in a heated bath of electrophoresis buffer, 
while TGGE gels have a fixed concentration of chemical denaturant. 

As a double stranded DNA molecule migrates through a DGGE gel from a low 
concetration of denaturant at the origin to higher concentrations of denaturant toward 
the end of the gel it eventually reaches a level of denaturant that will cause partial 
melting. (Some design of DNA molecules is often necessary to assure that the partial 
melting will occur as desired; see below.) The concentration of denaturant required to 
melt a given DNA segment is highly sensitive to sequence differences in the DNA, 
including changes as subtle as a single nucleotide substitution. Partially melted DNA 
fragments move through gels at a much slower rates than their fully duplex 
counterparts. Thus two DNA fragments differing at a single nucleotide can be 
distinguished on the basis of their gel position after an appropriate period of 
electrophoresis: the fragment with the more stable structure (resulting from, for 
example, a G:C base pair in place of an A:T pair) will travel further in the gel than its 
less stable counterpart, because it will encounter the concentration of gradient required 
to melt it (and consequently dramatically retard or nearly stop its movement) at a point 
further along in the gel. 

The DGGE method reveals the presence of sequence variation between individuals as 
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shifts in electrophoretic mobility, but does not show the sequence itself. Direct 
sequencing of DNA fragments (from different individuals) with altered mobility in the 
DGGE assay will reveal the precise sequence differences among them (see example 
37, Variance Detection by DNA Sequencing). From the nucleic acid sequence data, 
the amino acid sequence can be determined and any amino acid differences can be 
identified. 



The DGGE method is suitable for analysis of restriction enzyme digested genomic 
DNAs, as initially described by Lerman and co-workers (supra) and later extended 
(Gray, M. Detection of DNA Sequence Polymorphisms in Human Genomic DNA by 
Denaturing Gradient Blots, American Journal of Human Genetics, 50: 331-346, 1992). 
DGGE is equally suitable for analysis of cloned DNA fragments or DNA fragments 
produced by PCR. The analysis of cloned fragments or PCR fragments has the 
advantage that non-natural sequences, rich in G and C nucleotides can easily be added 
to the 5' ends (either flanking the cloning site or at the 5' ends of PCR primers). Such 
DNA fragments have very stable double stranded segments, called GC clamps, at one 
or both ends. The GC clamps alter the melting properties of the fragments, and can be 
designed so as to insure melting of the inter-primer segment of the PCR product at a 
lower temperature than the clamps, thereby optimizing the detection of sequence 
differences (see Myers et alia, supra and Myers et alia, Nearly All Single Base 
Substitutions in DNA Fragments Joined to a GC Clamp Can be Detected by 
Denaturing Gradient Gel Electrophoresis. Nucleic Acids Research 13: 3131, 1985). 
GC clamps can be rationally designed for any specific DNA fragment of known 
sequence by use of a computer program (MELT87, written by L. Lerman) that 
accurately predicts melting behavior based on analysis of primary sequence. When GC 
clamps are used correctly, the DGGE method is highly efficient at detecting DNA 
sequence differences. Not only are nearly 100% of differences detected, but the false 
positive rate is essentially zero. (Abrams, E.S., et alia, Comprehensive Detection of 
Single Base Changes in Human Genomic DNA Using Denaturing Gradient Gel 
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Electrophoresis and a GC Clamp. Genomics 7: 463-475, 1990.) Recently methods for 
increasing the throughput of DGGE have been developed, based on multiplex PCR. 

The steps in carrying out DGGE with GC clamps are: 

5 

1. Design DNA fragments with optimal melting behavior. Select oligonucleotide 
primers, using GC clamps as necessary, to produce a single melting domain over the 
length of the sequence to be analyzed. (It may be necessary to divide the sequence into 
overlapping fragments to achieve this goal.) Design of primers and simulated analysis 

10 of fragments can be performed with the computer program described by Lerman. 

(Lerman, L.S. and Silverstein, K. Computational Simulation of DNA Melting and its 
Application to Denaturing Gradient Gel Electrophoresis. Methods in Enzymology 
155: 482-501, 1987.) The output of the program is the melting map of the fragment, 
from which it will also be possible to determine the optimal range of denaturant in the 

15 gradient and the approximate electrophoresis time for fragments to reach the point of 

melting in the gradient. 

2. Amplify the fragment by PCR. Procedures for optimizing PCR are briefly described 
in other examples and are well known in the art. Template DNA samples can either be 
cDNA or genomic DNA and will typically be drawn from a panel of unrelated 

20 individuals. 



3. Pour a denaturing gradient gel Briefly, make up two gel solutions containing the 
desired beginning and end concentrations of denaturant. The gel solutions are 
generally made up by mixing "0%" and "100%" denaturant stock solutions, where the 
0% stock consists of 7% acrylamide in Tris-acetate EDTA (TAE) electrophoresis 
buffer, and the 100% stock is also 7% acrylamide in TAE, plus 40% formamide by 
volume and 7 molar urea. Equal volumes of the two solutions (e.g. twelve milliliters 
of each solution) are poured into the two chambers of a gradient maker (usually 
between 20 and 40% denaturant in the upstream chamber and 60 to 80% in the lower 
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one) immediately after addition of ammonium persulfate and TEMED for acrylamide 
polymerization. Open the stopcock of the gradient maker and pour the gradient gel. 
Usually gels are .75 to 1 mm in thickness, and gel combs that form 10-30 wells are 
used. With commercially available apparatus multiple gradient gels can be poured 
simultaneously. Suitable apparatus is sold by several vendors, including the BioRad 
(Hercules, CA) Dcode system and the C.B.S. Scientific DGGE system. 

4. Place the gel in a heated bath of electrophoresis buffer. Gels are electrophoresed 
at elevated temperature which, together with the denaturant, brings the DNA fragments 
to their melting point. Gels are often run at 60°C in IX TAE buffer, with constant 
recirculation of buffer to the upper buffer chamber. Once the gel has been placed 
in the heated tank and allowed to equilibrate it can be loaded. Multiple gels can be 
run simultaneously in the same tank with the apparatus listed above. 

5. Load and run gel. Usually enough PCR product from each sample is loaded on 
the gel so that samples can be detected by a simple DNA staining procedure; use of 
radioactivity, dyes or hybridization procedures can thereby be avoided. At least 100 
mg of each sample should be loaded, but preferably over 200 ng. Gel running 
conditions can be estimated from the output of the MELT87 program, however 
empirical adjustment will often be necessary. Usually a voltage of - 80 to 200V is 
applied for periods of 5-20 hours, depending on the characteristics of the fragments 
being analyzed. 

6. Stain and analyze gel After electrophoresis gels are stained with ethidium 
bromide, SYBR Green, silver or some other procedure. The location of PCR 
products produced with the same primer pairs should be compared. Altered 
location, and usually the appearance of two or more bands instead of one, signify the 
presence of DNA sequence differences. (The reason for more than two bands from 
a diploid sample is that during the terminal cycle of heating and cooling of the PCR 
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step heteroduplexes are formed between the maternally and paternally inherited 
alleles. If those alleles differ in sequence, the heteroduplexes will have mispaired 
nucleotides at the sites of difference. As a result the heteroduplexes will be less 
stable than either of the homoduplex species, and will consequently melt and be 
5 retarded in the gel at a lower concentration of denaturant. Altogether one may see 

four bands in such samples: two reciprocol heteroduplexes and two homoduplexes.) 
The specific pattern of fragments in each lane constitutes a signature for a specific 
nucleotide change. 

10 7 - Sequence DNA fragments with altered mobility. Examples of all different 

signatures should next be analyzed by DNA sequencing to identify the base 
difference(s) accounting for altered mobility in the gradient gel. See example 37 for 
a description of this procedure and the subsequent steps of recording the sequence 
variances and analyzing their frequency and structural and functional consequences. 

15 

Example 37: Variance detection by sequencing. 

Sequencing by the Sanger dideoxy method or the Maxim Gilbert chemical cleavage 
20 method is widely used to determine the nucleotide sequence of genes. Presently, a 

worldwide effort is being put forward to sequence the entire human genome. The 
Human Genome Project as it is called has already resulted in the identification and 
sequencing of many new human genes. Sequencing can not only be used to identify 
new genes, but can also be used to identify variations between individuals in the 
25 sequence of those genes. 

The following are the major steps involved in identifying sequence variations in a 
candidate gene by sequencing: 
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1 . Amplification by the polymerase chain reaction (PCR) of 400-700 bp regions 
of the candidate gene from a panel of DN A samples The DNA samples can 
either be cDNA or genomic DNA and will represent some cross section of 
the world population. 

2. Sequencing of the resulting PCR fragments using the Sanger dideoxy method. 
Sequencing reactions are performed using flourescently labeled dideoxy 
terminators and electrophoresedon an ABI 377 sequencer or its equivalent. 

3. Analysis of the resulting data from the ABI 377 sequencer using software 
programs designed to identify sequence variations between the different 
samples analyzed. 

A more detailed description of the procedure is as follows: 

A candidate gene sequence is downloaded from an appropriate database. Primers for 
PCR amplification are designed which will result in the target sequence being 
divided into amplification products of between 400 and 700 bp. There will be a 
minimum of a 50 bp of overlap not including the primer sequences between the 5' 
and 3' ends of adjacent fragments to ensure the detection of variances which are 
located close to one of the primers. 

Optimal PCR conditions for each of the primer pairs is determined experimentally. 
Parameters including but not limited to annealing temperature, pH, MgCl 2 
concentration, and KC1 concentration will be varied until conditions for optimal PCR 
amplification are established. The PCR conditions derived for each primer pair is 
then used to amplify a panel of DNA samples (cDNA or genomic DNA) which is 
chosen to best represent the various ethnic backgrounds of the world population or 
some designated subset of that population. 

PCR reactions are purified using the QIAquick 8 PCR purification kit (Qiagen cat# 
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28142) to remove nucleotides, proteins and buffers. The PCR reactions are mixed 
with 5 volumes of Buffer PB and applied to the wells of the QIAquick strips. The 
liquid is pulled through the strips by applying a vacuum. The wells are then washed 
two times with 1 ml of buffer PE and allowed to dry for 5 minutes under vacuum. 
The PCR products are eluted from the strips using 60 ul of elution buffer. 

The purified PCR fragments are sequenced in both directions using the Perkin Elmer 
ABI Prism™ Big Dye™ terminator Cycle Sequencing Ready Reaction Kit (Cat# 
4303150). The following sequencing reaction is set up: 8.0 ul Terminator Ready 
Reaction Mix, 6.0 ul of purified PCR fragment, 20 picomoles of primer, deionized 
water to 20 ul. The reactions are run through the following cycles 25 times: 96°C 
for 10 second, annealing temperature for that particular PCR product for 5 seconds, 
60°C for 4 minutes. 

The above sequencing reactions are ethanol precipitated directly in the PCR plate, 
washed with 70% ethanol, and brought up in a volume of 6 ul of formamide dye. 
The reactions are heated to 90°C for 2 minutes and then quickly cooled to 4°C. 1 ul 
of each sequencing reaction is then loaded and run on an ABI 377 sequencer. 

The output for the ABI sequencer appears as a series of peaks where each of the 
different nucleotides, A, C, G, and T appear as a different color. The nucleotide at 
each position in the sequence is determined by the most prominent peak at each 
location. Comparison of each of the sequencing outputs for each sample can be 
examined using software programs to determine the presence of a variance in the 
sequence. One example of heterozygote detection using sequencing with dye labeled 
terminators is described in Pui-Yan Kwok et. al (Pui-Yan Kwok, Christopher Carlson, 
Thomas D. Yager, Wendy Ankener, and Deborah A. Nickerson, Genomics 23, 138- 
144 (1994)). The software compares each of the normalized peaks between all the 
samples base by base and looks for a 40% decrease in peak height and the concomitant 
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appearance of a new peak underneath. Possible variances flagged by the software are 
further analyzed visually to confirm their validity 



Example 38. Loss of heterozygosity. 

Loss of chromosomes or segments of chromosomes in disease cells results in loss of 
alleles in the disease cells compared to normal diploid cells. Such allele losses are a 
common occurrence in cancer, where they have been documented in over 1,500 
publications in the past 14 years. More recent work has documented the occurrence 
of allele loss in other proliferative diseases. Several cytogenetic and molecular 
techniques have been developed to measure chromosome losses. The molecular 
techniques are preferable for identification of allele loss because they also show which 
allele is lost, and are therefore best suited to provide the information needed to 
implement the present invention. 

In order to measure chromosome loss using molecular techniques it is necessary to be 
able to distinguish the paternally and maternally inherited copies of a given 
chromosome. DNA variances allow the two copies of a given chromosome to be 
distinguished because different alleles can be resolved electrophoretically. The 
standard method for analyzing allele loss in cancer is to compare tumor cell DNA with 
normal cell DNA, either in a Southern blot or using PCR based techniques. A patient's 
tumor DNA is said to be "informative" for allele loss only at loci where the patient's 
normal cells are heterozygous. When such heterozygous loci are examined in tumor 
cells often only one allele is detected. Such tumor cells have lost the heterozygous 
state which characterizes all normal somatic cells of the patient, hence the term loss of 
heterozygosity (LOH). 
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Several effective molecular procedures have been developed to measure LOH. These 
procedures have been applied most extensively to cancer tissues, however the same 
methods are effective in the study of nonmalignant diseases such as atherosclerotic 
plaques and endometriosis. The main steps are: 

1 . Identify DNA variances at or near the locus to be investigated for LOH. 

LOH usually affects large segments of DNA, ranging from several megabases to an 
entire chromosome. As a result, accurate estimation of LOH at a specific locus can be 
obtained by measuring the frequency of LOH at neighboring polymorphic markers on 
the same chromosome, or more preferably on the same chromosome arm, or most 
preferably within several 10-20 megabases of the locus. However, to precisely 
measure LOH at a specific locus requires a variance at the locus. Different types of 
variances have been used to study LOH, including single nucleotide polymophisms 
(SNPs), specifically SNPs that alter restriction endonuclease cleavage sites, called 
RFLPs. (For details of this approach see Vogelstein, B., et al., Allelotype of colorectal 
carcinomas. Science 244: 207-211, 1989). Also short tandem repeat polymorphisms 
(STRPs), including di-, tri- and tetranucleotide repeat polymorphisms have been used 
to measure LOH. (For details of this procedure see Jones and Nakamura, Deletion 
Mapping of Chromosome 3p in Female Genital Tract Malignancies Using 
Microsatellite Polymorphisms. Oncogene 7: 1631-1634, 1992.) Procedures for 
identifying variances are described in Examples 28, 29, 30 and 36. 

2. Prepare DNA from paired normal and disease tissue samples from patients being 
studied. 

Before preparing genomic DNA from tumor tissue it is important to assess tumor cell 
purity and viability, using microscopic examination of frozen sections if necessary. 
If embedded pathological specimens are being analyzed tumor cell purity can be 
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assessed by examining histologic sections before selecting areas for cell isolation and 
DNA purification. (See Johnson, et al., Direct Molecular Analysis of Archival Tumor 
Tissue for Loss of Heterozygosity, BioTechniques 19:190-191, 1995, and references 
therein for description of techniques for purifying tumor cell DNA from archival 
5 pathology samples.) Areas of necrosis and extensive admixture of normal and tumor 

tissue should be avoided. For Southern blotting -5- 1 0 ug of genomic DNA is required 
for each sample being analyzed. For PCR based methods as little as 5 to 10 ng of 
genomic DNA is sufficient; much less will suffice if two successive rounds of PCR 
amplification are used. 

10 3. Determine genotype in the normal and disease tissues using a quantitative or semi- 

quantitative procedure that allows the amount of each allele to be measured. Compare 
the ratio of alleles in the normal tissue to the ratio in the tumor tissue 



In order to show LOH at a given locus it is necessary to establish that the patient is 
15 constitutionally heterozygous at the locus. Thus DNA from normal tissue must be 

tested, either before or in parallel with tumor tissue DNA. A variety of methods can 
be used for quantitation of signal from the two alleles. If the alleles are compared on 
a Southern blot then signal in the bands corresponding to the two alleles can be 
counted by radioactive or nonradioactive techniques (see Ausubel, et al., Current 
20 Protocols in Molecular Biology, John Wiley & Sons). One method employs phosphor 

technology using a Molecular Dynamics Phosphorlmager with ImageQuant software 
to measure signals. If the alleles are compared after PCR amplification then DNA 
sequencing can provide accurate quantitation of allele ratios. See, for example, 
Goldsborough and Komberg, Allele-Specific Quantification of Drosophila Engrailed 
25 and Invected Transcripts, Proc. Natl. Acad. Sci. U.S.A. 91 : 12696-12700, 1994. 

Using highly variable markers distributed across the genome a comprehensive map of 
LOH can be assembled for a specific cancer type. Such data sets have been termed 
allelotypes. Separate studies are necessary for different cancer (or other disease) types 
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as the patterns of LOH differ significantly in different diseases. 

Other techniques that have been used to detect allele loss in cancer include 
Comparative Genomic Hybridization (CGH) and Representation Difference Analysis 
5 (RD A) however these methods are more complex than the Southern blot or PCR based 

techniques. Chromosome loss can also be detected cytogenetically. Mitelman 
(Catalog of Chromosome Aberrations in Cancer. Wiley-Liss, New York, 1995.) has 
compiled a catalog of over 10,000 published karyotypes of cancer cells which 
documents chromosome deletions as well as other changes. 

10 



Example 39. Small molecule inhibitors of variant sequences: 
Methylguanine Methyltransferase (MGMT) 
Gene VARIA 1534 



The methylguanine methyltransferase gene is essential for cell growth or survival in 
the presence of alkylating agents 

20 Methylguanine methyltransferase (MGMT) is a nuclear protein that repairs alkylating 

agent damage, specifically alkylation of the 06 position of guanine bases in genomic 
DNA. MGMT acts as a suicide protein in removing methyl or alkyl groups from 
guanine and covalently binding them to cysteine 145 of MGMT. The protein is 
subsequently degraded; it does not act as an enzyme. 06-benzylguanine is an inhibitor 

25 of MGMT that mimics the natural substrate, alkylated DNA; transfer of the benzyl 

group to cysteine 145 of MGMT inactivates the protein. Concurrent administration of 
06-benzylguanine and an alkylating agent such as carmustine (BCNU) or lomustine 
(CCNU) renders tumor cells more sensitive to the toxic effects of the nitrosoureas by 
inactivataing MGMT and thereby inhibiting the tumor cells ability to repair alkylated 
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DNA. MGMT is thus a conditionally essential gene in the presence of nitrosoureas and 
other alkylating agents. The conditional essentiality of MGMT has been demonstrated 
in mice. Animals homozygous for disrupted MGMT genes are more than ten times as 
sensitive to alkylating agents as normal mice. The relative sensitivity has been 
5 measured as the LD50, the dose required to kill 50% of treated animals. (Tsuzuki, T., 

et al. Targeted disruption of the DNA repair methyl transferase gene renders mice 
hypersensitive to alkylating agent. Carcinogenesis 17: 1215-1220, 1996.) 06- 
benzylguanine is being developed as a chemosensitizing agent (with alkylating agents) 
for treatment of human cancer. This treatment regimen is not specific for cancer cells. 

10 

In a cancer patient with two alternative functional MGMT alleles in normal tissues and 
LOH at 10q23 resulting in only one copy of MGMT in cancer cells, an allele specific 
inhibitor of MGMT could be used to specifically sensitize cancer cells to the action of 
alkylating agents. Treatment would consist of the administration of the appropriate 

15 allele specific inhibitor (directed to the one allele remaining in cancer cells) plus an 

alkylating agent. The tumor cells would be unable to effectively repair the alkylating 
agent induced DNA damage, while the uninhibited allele in normal cells would be able 
to function. Thus normal cells, including sensitive normal cell populations such as 
bone marrow stem cells, would be able to tolerate higher doses of alkylating agents 

20 than cancer cells. 



The MGMT gene and encoded protein are polymorphic 

Four variances in human MGMT have been discovered by the inventors or reported in 
25 the literature, including three variances that affect the protein sequence. There is a C/T 

variance at nucleotide 255 (1 1% heterozygotes among 36 individuals surveyed) which 
does not affect the encoded protein. There is a second C/T variance at nt. 346 which 
results in a L84F amino acid variance (5% heterozygotes among 36 individuals 
surveyed). There is an A/G variance at nt. 523 which results in a 1 143V amino acid 
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variance (24% heterozygotes among 36 individuals surveyed). This variance occurs 
only two residues from the active site cysteine at 145. A fourth variance, G/A has been 
reported in the Japanese population at codon 160, GGA vs. AGA, resulting in a glycine 
vs. argtnine amino acid variance. Fifteen percent of 40 Japanese individuals studied 
5 were heterozygotes for this variance. (Imai, Y., et al. A polymorphism at codon 160 

of human 06-methylguanine-DNA methyltransferase gene in young patients with adult 
type cancers and functional assay. Carcinogenesis [London] 16:2441-24445, 1995.) 

Allele specific inhibitors of MGMT 

Two of the amino acid variances in MGMT, at residues 143 and 160, are near the 
active site of the protein. Substantial work has already been done to characterize the 
functional consequences of the residue 160 glycine/arginine variance. Studies of 
MGMT kinetics and activity have shown that the 160arginine allele is at least 20 fold 
more resistant to 06 benzylguanine inactivation, measured as an increase in the ED50 
and or as a reduction in the production of guanine from 06-benzyl[8-3H] guanine. The 
160gly and 160arg forms of MGMT were nearly equal in alkyltransferase activity in 
an assay that measured repair of 06-methylguanine in methylated DNA. These results 
demonstrate variance-specific effects of a small molecule, 06-benzylguanine, on 
normal (non-mutant) alleles of the conditionally essential MGMT gene. (Edara, S., et 
al. Resistance of the human 06-alkylguanine-DNA alkyltransferase containing arginine 
at codon 160 to inactivation by 06-benzylguanine. Cancer Research 56: 5571-5575, 
1996) 

25 Administration of 06-benzylguanine to patients who are heterozygous for the variance 

in their normal cells, and contain only the alternative form of the gene with a glycine 
residue at position 160 in their cancer cells, together with methylating or 
chloroethylating agents, will specifically sensitize cancer cells to the cytotoxic effects 
of the alkylating agents without increasing toxicity to normal cells which, since they 



10 



15 



20 
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contain the 06-benzylguanine resistant 160arginine form of the protein, will continue 
to repair alkylated DNA. 



There is no published data concerning the residue 143 variance, however the 
5 proximity of this variance to the active site - both in the primary sequence and upon 

inspection of the three dimensional structure of the bacterial AGT protein, a 
functional and structural homolog of human MGMT - suggests that allele specific 
drugs could be discovered for this variance. 



10 The structural difference between 143isoleucine and 143valine is a hydrophobic 

methyl group. It is well known that most small molecule protein inhibitors interact 
via hydrophobic interactions. Favorable Van der Waals distances between 
hydrophobic groups of a substrate and a ligand are vital for high affinity interaction. 
One possible mechanism of allele specific inhibition would be to exploit the greater 

15 bulk of the isoleucine by finding a small molecule that fits into the active site 

pocket of the valine allele but has a very unfavorable Van der Waals interaction the 
methyl group of the isoleucine. Other schemes based on the different size and 
geometry of isoleucine and valine could also be effective. 



20 One approach to identification of such inhibitors would be to make small molecule 

libraries in which various positions of guanine are substituted with moities of 
appropriate size and structure. Such libraries could then be tested in various screens 
of MGMT activity. The two alleles (143isoleucine and 143valine, or any of the 
other allele pairs of MGMT described above) would be assayed in parallel. 

25 Identification of molecules with allele specific inhibitory activity could be the basis 

for synthesis of additional libraries in which the moities that are best correlated with 
differential activity are further varied. Methods for the iterative design of high 
affinity or highly discriminating small molecule inhibitors are known in the art. 
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Libraries of restricted size can be screened for allele specific inhibitors using a 
combinatorial strategy based on known inhibitors of MGMT such as 06-benzyl- 
guanine. A library or libraries can be constructed in which substitutions are 
indroduced at positions C6 and N9 which have previouly been found to affect 
5 inactivation of MGMT, or at positions C2 and N8 which can be easily substituted. 

For example a series of 4(6)-(benzyloxy)-2,6(4)-diamino-5-(nitro or 
nitroso)pyrimidine derivatives and analogs in which 4(6)-benzyIoxy groups were 
replaced with (2-, 3-, or 4 fluorobenzyl)oxy or (2-, 3-, or 4-pyridylmethyl)oxy 
groups have been synthesized and tested for MGMT inhibition. (Terashima I., and 
10 K. Kohda. Inhibition of human 06-alkylguanine-DNA alkyltransferase and 

potentiation of the cytotoxicity of chloroethylnitrosourea by 4(6)-(Benzyloxy)- 
2,6(4)-diamino-5-(nitro or nitroso)pyrimidine derivatives and analogues. J Med 
Chem 41 : 503-508, 1998.) Substitutions at N7 have been found to be detrimental 
in general (Moschel, R.C. et al & Pegg, A. E., J. Med Chem. 35: 4486-4491, 1992). 

15 

Combinatorial libraries can be constructed according to a published procedure 
(Norman, T. C. et al., A Structure-Based Library Approach to Kinase Inhibitors. J, 
Am. Chem.Soc. 1 18: 7430-7431, 1996) where guanine based libraries were made by 
anchoring a chemically modified guanine (at C6, C2, or C8) to solid supports at C2 
20 via a glycinamide linkage or at N9 via a hydroxyethyl linkage. Chemical reactions 

can be carried out to introduce a library of hydrophobic substituents of different size 
at positions C6, C2, or C8. Hydrophobic substituents of various bulkiness and 
orientation can be indroduced through derivatives of 06-benzyl and 06-phenyl 
groups, 06-alkyl groups, N9-alkyl groups, and C2-amino-alkyl groups. 

25 

Libraries constructed as above can be screened for MGMT activity in several types 
of assays. Methods for bacterial expression and purification of human MGMT 
protein have been described (see Edara, et al., cited above). Both allelic forms of 
MGMT could be screened for repair of alkylated or methylated DNA by measuring 
30 transfer of tritium from a tritium labelled (methylated) DNA substrate in the 
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presence of various concentrations of library compounds for various times. 
Alternatively, library compounds could be tritiated and MGMT proteins could be 
screened for the rate at which they interact with (either via association or cleavage 
of a moiety from the compound). Other assays for MGMT activity are known in 
5 the art. 



Example 41. Clinical use of variance specific inhibitors for treating cancer 

10 Inhibitors that are the object of the present invention are designed to be 

administered to patients who are heterozygous for the target gene, meaning that 
their cells normally contain two alternative copies of the gene, one that is sensitive 
to inhibition by said inhibitors, and one that is not sensitive to said inhibitors. It is 
apparent that several such inhibitors may be developed according to this invention 
15 targeted to alternative alleles of a single target gene or to several different target 

genes. The inventors propose that a series of such inhibitors will be developed 
according to this invention. 

The clinical use of this invention involves the steps of: 

(a) testing normal cells from a patient to identify target genes that are heterozygous, 
present in two alternative forms. 

(b) testing biopsy tissue from a tumor or proliferative lesion to determine whether 
one of the two alternative forms is eliminated due to LOH. 

(c) selecting a drug for inhibition based on the presence of the sensitive allele in the 
tumor and the presence of an insensitive allele in normal cells 

(d) administering said drug to the patient in an appropriate dose to inhibit the 
essential function in the cancer cell. 

Testing of normal cells to identify heterozygosity of the target gene is performed 



20 



25 
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using conventional diagnostic methods that are known in the art. Normal cells are 
commonly derived from a blood sample, hair sample, or buccal smear. 
Alternatively normal cells may be obtained by cultivating primary cells such as 
lymphoblasts or fibroblasts in vitro. The presence of two alternative alleles may be 

5 determined by methods including allele-specific hybridization with oligonucleotides 

containing the variant sequences and a number of non-variant nucleotides to allow 
differential binding to the alternative forms of the gene or other methods known in 
the art using purified DNA or RNA or amplified DNA or cDNA sequences. 
Testing of biopsy tissue is performed by separating tumor cells or cells of the 

10 proliferative lesion to isolate a sample of cells characteristic of the proliferative 

lesion for analysis. This is performed by a variety of methods known in the art 
including manual dissection or laser assisted methods for eliminating normal cells 
or selecting abnormal cells. Samples of abnormal tissue, and samples of normal 
tissue as a control, are analyzed to identify the presence or absence of alternative 

15 forms of the target gene. The presence of two altrnative alleles may be determined 

by methods including allele-specific hybridization with oligonucleotides containing 
the variant sequences and a number of non- variant nucleotides to allow differential 
binding to the alternative forms of the gene or other methods known in the art using 
purified DNA or RNA or amplified DNA or cDNA sequences. 

20 

Selection of a drug for administration will be based on clinical trial data indicating 
that the drug is effective in eliminating abnormally proliferating cells and causing 
an improvement in the patient's clinical condition for patients who have the 
sensitive allele of the target gene in their pathological lesion. In one aspect of this 
25 invention, the product label will describe that the drug is indicated in patients who 

have only a specific allele of the target gene in their lesion and an alternative allele 
in their normal cells. Any such drug will be indicated only for a fraction of patients 
having two alternative alleles of the target gene in their normal cells and LOH. The 
fraction of patients who may be treated with any one drug may be determined by 
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multiplying the number of patients with a given cancer times the fraction of tumors 
exhibiting LOH of the target gene locus times the fraction of patients who will be 
heterozygous. For a target gene exhibiting 50% heterozygosity in the population 
and a 70% fraction of LOH in a specific cancer (several such examples are shown), 
a single inhibitor will treat -17% of such cancers. A second compound directed 
against the alternative allele would treat another 17% of said cancer. In the 
preferred use of this invention, a panel of such drugs will be available enabling 
therapy with at least one such drug in most patients. 

Administration of the drug to the patient ration to the patient will involve 
conventional means such as parenteral, oral, or intratumoral administration. The 
route of administration will be determined separately for each inhibitor and will be 
based on the bioavailability of the compound to the lesion. The compound may be 
administered in one or more doses as a single agent or in combination with other 
allele specific agents or conventional antiproliferative drugs or agents commonly 
used for the treatment of cancer or support of cancer patients. 



Example 42.Cell Division Cycle 25C (CDC25C) - Gene VARIA10 

Cdc25C is essential for cell growth 

A vital regulator of cell proliferation is the protein kinase Cdc2, whose activation at 
the end of G2 of the cell cycle initiates mitosis. Gene disruption experiments in 
yeast confirm the importance of this protein, as cells lacking Cdc2 fail to progress 
through the cell cycle. As would be expected for such an important protein, Cdc2 
activity is tightly regulated. Its activity depends on complex formation with Cyclin 
B, a protein that accumulates through the cell cycle and is then abruptly degraded 
during mitosis. Phosphorylation of Cdc2 on Tyr-15 and Thr-14 by the Weel/Mikl 
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kinases maintains the Cdc2/Cyclin B complex in an inactive state until the end of 
G2. The dual-specificity phosphatase Cdc25C is then stimulated to 
dephosphorylate Cdc2 on both residues, resulting in activation of the complex. Just 
as Cdc2 is essential for cell growth, the regulation of its activity is essential. The 
best evidence for this is that the individual disruption of cdc2, cyclin B, wee 1 and 
cdc25 in the yeast S. pombe are lethal events. When cdc25 is deleted from these 
cells they display a phenotype consistent with their function; they grow without 
dividing, becoming dramatically elongated. 

The human CDC25C gene and protein have variances 

The CDC25C cDNA was cloned by Sadhu et ah (1) (Genbank accession number 
M34065, GI number 181075). To determine whether CDC25 is polymorphic, 
VARIAGENICS scanned cDNA from 32 unrelated individuals using the T4 
Endonuclease VII method, which involves the cleavage of DNA heteroduplexes 
followed by DNA sequencing of polymorphic regions (see description of method in 
examples). A transversion at nucleotide 1099 (G or C) was identified (nucleotide 
numbering is from reference 1). This results in an amino acid difference at residue 
297, with G encoding glycine and C encoding arginine. Overall, 9.4% of 
individuals analyzed are heterozygous. The rate of heterozygosity increases to 
33.3% in Caucasians. 

The human CDC25C gene maps to chromosome 5q31, a site of frequent loss of 
heterozygosity 

Sartor et al (2) mapped the human CDC25 gene to 5q3 1 by fluorescence in situ 
hybridization using the cDNA cloned by Sadhu et al This mapping location was 
confirmed by Taviaux and Demaille (3), also using fluorescence in situ 
hybridization. There have been many studies of LOH on 5q, particularly the 5q21- 
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q22 region where the Adenomatous Polyposis Coli (APC) tumor suppressor gene 
lies. The most extensively studied cancers are those of the gastrointestinal tract, 
lung and ovary. There have been fewer studies of the 5q23-q33 region just distal to 
APC (where CDC25C lies), however the available data suggests that LOH occurs in 

5 this region at a frequency of -30% in cervical cancer (4), 20-40% in colon cancer 

(5,6), 30-50% in ovarian cancer (7,8), up to 38% in stomach cancer (9), and 23% in 
testicular cancer (10). There is also evidence for LOH in head and neck, lung and 
liver cancers. In most of these studies only one or two markers were used. 
Definitive assessment of LOH frequency at the CDC25C locus will require direct 

10 analysis of the polymorphisms identified in various tumor types. 
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Example 43. Dihydropyrimidine Dehydrogenase (DPD) 

10 DPD is conditionally essential 

Dihydropyrimidine Dehydrogenase is essential for cell survival in the presence of 
pyrimidine nucleotide analogs such as 5-FU and fluorodeoxyuridine. 5-fluorouracil 
(5-FU) and related compounds are antineoplastic drugs used in the treatment of 
15 breast, gastrointestinal, head and neck and other cancers. These drugs have widely 

varying clinical effects in cancer patients, ranging from induction of complete 
response (tumor disappearance) in some patients to severe toxicity in others. There 
is currently no reliable basis for predicting individual patient responses, and 
therefore patients receiving 5-FU must be monitored carefully for toxic reactions. 

20 

There are a variety of anabolic and catabolic pathways that affect the action of 5-FU 
(reviewed in Goodman and Gilman, The Pharmacological Basis of Therapeutics, 
8th edition). For example, in order to exert its antiproliferative effects the 
pyrimidine analog 5-FU must be converted enzymatically to the nucleotide level 
25 (fluorodeoxyuridine) by phosphorylation and ribosylation; fluorodeoxyuridine is 

sometimes given directly because it bypasses most of these steps, and simply 
requires phosphorylation by thymidine kinase. The 5-fluoronucleotide is an 
irreversible inhibitor of thymidylate synthase, the enzyme which converts dUMP to 
dTMP and is required for de novo synthesis of thymidine, and hence for DNA 
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synthesis. 

There is a three step pathway for catabolism of pyrimidines (thymine and uracil) to 
beta alanine. Pyrimidine analogs such as 5-FU are catabolized by the same 
pathway. The first and rate limiting step in this pathway is catalyzed by 
dihydropyrimidine dehyrogenase (DPD). DPD accounts for catabolism of as much 
as 90% of a 5-FU dose in normal individuals, and the half life of 5-FU in normals is 
-8-20 minutes. Patients homozygous for mutant DPD alleles have been identified, 
a condition variously called DPD Deficiency, Hereditary Thymine-Uraciluria or 
Familial Pyrimidinemia. In such patients -90% of 5-FU is excreted unchanged in 
the urine, and the drug has a half life longer that 2.5 hours. As a result of the 
drastically reduced catabolism of 5-FU the toxic effects of the drug are magnified 
and patients are subject to severe toxic reactions. There are reports of deaths in 
patients with DPD deficiency after treatment with 5-FU. Thus cell (and organism) 
survival in the presence of 5-FU depends on presence of functional DPD protein to 
transform 5-FU to the inactive dihydroxy metabolite. 

This principal has also been demonstrated in cancer cells both in vitro and in vivo: 
cancer cells with lower DPD levels are more susceptible to the toxic effects of 
5-FU. It has been suggested that measuring DPD levels would be useful for 
calibration of 5-FU dosage. 

The DPD gene exhibits variances 

We have identified four common sites of variance in DPD mRNA by screening 
cDNA from 36 unrelated individuals. The variant nucleotides are 166, 577, 3925 
and 3937 (see DPD Variance Table; numbering is from Yokota, et al. cDNA 
Cloning and Chromosome Mapping of Human Dihydropyrimidine Dehydrogenase, 
an Enzyme Associated with 5-fluorouracil Toxicity and Congenital Thymine 
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Uraciluria. Journal of Biological Chemistry. 269:23192-23196,1994). Two of the 
variances in nucleotide sequence alter the amino acid coding sequence: amino acid 
29 is usually cysteine but arginine alleles were also detected; cys/arg heterozygotes 
were found at a frequency of 1 1%. Residue 166 of DPD is reported to be 

5 methionine but valine is present at 166 in some alleles; 9% of the population 

surveyed are met/val heterozygotes. One double heterozygote was identified out of 
36 patients. Both these amino acid polymorphisms are located in the N-terminal 
NAD/FAD binding domain of DPD. Residue 166 is located in a highly conserved 
domain of DPD. Two other polymorphisms are located in the 3' untranslated region 

10 of DPD, only 1 1 nucleotides apart. 

The DPD gene maps to chromosome lp22 t a region frequently subject to LOH in 
different cancers 

The DPD gene has been mapped to chromosome lp22 by fluorescense in situ 
15 hybridization. LOH at lp22 has been reported in colon, breast, and other cancers. 

Allele specific inhibition of DPD to potentiate 5-FU action in cancer cells with 
LOH at the DPD locus 



20 The DPD gene is polymorphic and conditionally essential in the presence of 5-FU. 

These properties can be exploited in a therapeutic strategy for cancer patients with 
LOH at the DPD locus. Specifically, in a patient with two alternative alleles for 
DPD in normal cells and one allele in cancer cells due to LOH, an allele specific 
drug can be used to sensitize cancer cells to the action of 5-FU by inhibiting its 

25 catabolism. Cancer cells (but not normal cells) would be poisoned by high levels of 

5-FU due to low clearance. Normal cells, containing an uninhibited allele, would be 
able to catabolize DPD at close to normal levels. 



Alternatively, patients heterozygous for functional and defective copies of DPD, 
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and in whom LOH resulted in loss of the functional allele, could be treated by 5-FU 
without the necessity for an allele specific inhibitor. Identification of such patients 
would require a test for heterozygosity at DPD and a test for LOH which could 
show which allele is deleted in cancer cells. Such an approach would be expected 
to identify patients likely to respond well to 5-FU even though they might have 
cancers not traditionally treated with pyrimidine analogs. 

Example 44. Fanconi Anemia genes A, B, C, D, E, F, G and H (FAA, FAB, 
FAC, FAD, FAE, FAF, FAG, FAH) 

The Fanconi Anemia genes are conditionally essential 

The Fanconi Anemia genes are essential for cell growth or survival in the presence 
of DNA cross linking agents. In order for cells to survive or proliferate in an 
abnormal environment characterized by the presence of DNA cross linking 
molecules such as Mitomycin C and diepoxybutane it is necessary that the cells are 
capable of efficiently repairing damage caused by these agents. Cells contain 
proteins necessary for such repair. One way such repair proteins can be identified is 
by absence of function in specific patients who, as a consequence, are particularly 
susceptible to the toxic effects of cross linking agents. 

Fanconi Anemia (FA) is a hereditary disease, autosomal recessive in transmission, 
characterized by progressive bone marrow failure, birth defects and predisposition 
to malignancies. FA patients are hypersensitive to the toxicity of DNA cross 
linking agents. This hypersensitivity can be measured in cultured FA cells, which is 
one method used to establish the diagnosis of FA. 

Patients heterozygous for defective FA genes are generally not hypersensitive to 
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10 



DNA crosslinking agents in contrast to those that are homozygous. This suggests 
that treating heterozygous cancer patients with an inhibitor specific for one allele of 
the FA gene (and thereby reducing levels of FA protein function by up to 50% in 
normal cells) would be well tolerated. Inhibition of the FA allele present in cancer 
cells but not the alternative form present only in normal cells would make cancer 
cells selectively sensitive to crosslinking agents, leading to a cytotoxic 
antiproliferative effect. Normal cells would be able to repair damage caused by 
such agents, by analogy to the clinical data from patients heterozygous for defective 
FA genes. 

The FA genes and gene products are polymorphic 



15 



20 



25 



Seven FA genes have been identified by complementation studies. The genes for 
FAA and FAC have been cloned. DNA variances have been reported in both genes. 
For example, Savino et al. report three variances in FAA, all of which alter the 
protein coding sequence. (Savino, M., et al. Mutations in the Fanconi Anemia 
Group A Gene (FAA) in Italian Patients. American Journal of Human Genetics 
61 : 1246- 1253, 1997.) The location of these variances is shown in the Table below, 
reproduced from the paper by Savino. 

Variances in the FAA Gene 



Polymorphic 


Alternate 


Affected amino 


Alternate 


Frequency of 


nucleotide 


bases 


acid residue 


amino acids 


rare allele 


796 


A, G 


266 


Thr,Ala 


.29 


1501 


G, A 


501 


Gly, Ser 


.40 


2426 


G, A 


809 


Gly, Asp 


.30 



30 



FA genes map to chromosomes that are frequently subject to LOH in different cancers 
The FAC gene maps to chromosome 9q22.3, (as do three other FA complementation 
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groups according to Strathdee, C.A., et al. Evidence for at least four Fanconi anaemia 
genes including FACC on chromosome 9. Nature Genetics 1: 196-198, 1992), The 
FAA gene maps to chromosome 16q24.3. FAD maps to 3p26-p22. All FA genes 
mapped so far lie in regions subject to frequent LOH. LOH affecting chromosome 9 
5 is well documented in many cancers. For example, loss of the 9q arm is well 

documented in cancers such as bladder, esophagus, ovary, testis and uterus. LOH 
frequencies in these cancers range from 20% to 62%. LOH affecting chromosome arm 
16q, particularly the 16q24 region is well documented, particularly in breast, prostate 
and liver cancers. For example, in six detailed studies of breast cancer in the 

10 16q22-q24 region LOH frequencies of 40-60% have been reported. Further, 16q22 

LOH has been reported in 25-90% of liver cancers, with the average around 45%. Less 
extensive studies of other cancer types report 16q22 LOH in 19% of bladder cancers, 
20% of colon cancers, 19-27% of esophageal cancers, 25% of small cell lung cancers, 
16-37% of ovarian cancers 22% of uterine cancers, and 31-50% of prostate cancers. 

15 Loss of chromosome 3p26-21 is common in lung cancer, kidney cancer, head and neck 

cancer and breast cancer among other cancers. Reports of >50% LOH are common in 
these cancer types. 

Other genes conditionally essential for response to DNA cross linking agents 

20 

In a related aspect, other genes which, when defective, sensitize cells to toxic effects 
of DNA crosslinking agents would be amenable to the therapeutic strategy outlined 
above for the FA genes. Specifically, in a patient with two alternative alleles for such 
a gene and LOH at the relevant locus, an allele specific drug could be used to sensitize 
25 cancer cells to the action of cross linking agents. Such drugs could then be used to 

treat cancer patients constitutionally heterozygous for two normal alleles at the relevant 
locus, in whom LOH had rendered cancer cells hemizygous or homozygous for one 
allele. Treatment would consist in the administration of the appropriate allele specific 
inhibitor plus a cross linking agent or treatment to induce damage in all cells. Cancer 
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cells (but not normal cells) would be rendered unable to respond by inhibition of 
expression of the relevant repair gene. Examples of such genes are the excision repair 
cross complementing (ERCC) genes, twelve of which have been identified (see Target 
Gene Table). Defects in these genes are associated with Xeroderma Pigmentosum and 
5 Cockayne Syndrome. (Scriver, C. R. et al., The Metabolic and Molecular Bases of 

Inherited Disease, 7th edition, McGraw Hill, New York, 1995.) 

Alternatively, patients heterozygous for functional and defective copies of such genes, 
and in whom LOH resulted in loss of the functional allele, could be treated by a cross- 

10 link inducing procedure without the necessity for an allele specific inhibitor. 

Identification of such patients would require a test for heterozygosity at the target locus 
and a test for LOH which could show which allele is deleted in cancer cells. Such an 
approach would be expected to identify patients likely to respond well to cross linking 
agents or procedures even though they might have cancers not traditionally treated with 

15 such agents. 



Example 45. Asparagine Synthetase (AS). 
Variagenics Target Gene 

20 

Asparagine Synthase is conditionally essential 

Cells require a continuous supply of amino acids for protein biosynthesis. Cells can 
import amino acids from serum via amino acid transporters (the only source besides 
25 protein catabolism for the ten essential amino acids), or amino acids cells can be 

synthesized de novo by cells (only an option for the ten nonessential amino acids). The 
essential amino acids are isoleucine, leucine, lysine, methionine, phenylalanine, 
threonine, tryptophan, valine and histidine. Alterations in the nutritional environment 
of growing cells that result in a decreased extracellular concentration of essential amino 
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acids cause arrested cell growth and may result in cell death. 

Even a nonessential amino acid can become essential in a cell where (i) at least one 
enzyme required for its biosynthesis is not expressed (perhaps due to downregulation 
in response to an abundant extracellular supply of the amino acid), or (ii) the 
biosythetic pathway is blocked by an inhibitor. 

Asparagine is a nonessential amino acid which is, however, essential for survival of 
rapidly dividing cells that are not expressing asparagine synthetase, the terminal 
enzyme in asparagine biosynthesis. Asparagine synthetase, considered to be a 
housekeeping gene, catalyzes the ATP dependent conversion of aspartic acid to 
asparagine in mammalian cells. A number of different cancer types do not usually 
express asparagine synthetase, including childhood acute leukemias. One common 
therapeutic used in the treatment of childhood acute lymphocytic leukemia is the 
enzyme L-asparaginase (purified from E. coli or Erwinia carotovora) which, upon 
injection, rapidly depletes serum asparagine (by hydrolysis to aspartate), thereby 
lowering blood levels of asparagine to undetectable levels within hours of injection. 
(Ohnuma, T. et al. Biochemical and Pharmacological Studies with L-Asparaginase in 
Man. Cancer Research 30: 2297-2305, 1970.) Leukemic cells have high rates of 
protein synthesis but do not express asparagine synthetase and are therefore highly 
vulnerable to the rapid loss of asparagine and consequent shutdown of protein 
synthesis. Cell death after L-asparaginase induced asparagine starvation has been 
shown to be apoptotic. (Bussolati, O. Characterization of Apoptotic Phenomena 
Induced by Treatment with L-Asparaginase in NIH3T3 Cells. Experimental Cell 
Research 220: 283-291, 1995.) After one or more doses leukemic cells often become 
resistant to L-asparaginase due to induction of asparagine synthetase activity and 
consequent autonomy for asparagine. 

In a patient with two alternative alleles for asparagine synthetase and LOH at 7q, an 
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allele specific drug could be used to sensitize cancer cells to the action of L- 
asparaginase. Such drugs could then be used to treat cancer patients constitutionally 
heterozygous for two normal alleles at the asparagine synthetase locus, in whom LOH 
had rendered cancer cells hemizygous or homozygous for one allele. Treatment would 
consist in the administration of the appropriate allele specific inhibitor plus L- 
asparaginase to deplete the concentration of this amino acid in serum while rendering 
cancer cells (but not normal cells) unable to respond by upregulating asparagine 
synthetase. 

The Asparagine Synthetase gene maps to chromosome 7q2L3, a region frequently 
subject to LOH in different cancers 

The asparagine synthetase gene has been mapped to chromosome 7q21.3 by 
fluorescence in situ hybridization, following localization to 7q by analysis of somatic 
cell hybrids. The q21 region of chromosome 7 is subject to frequent LOH, particularly 
in colon, breast and prostate cancers. 7q21.3 LOH is detected in up to 50% of colon 
cancers, up to 37% of prostate cancers (83% of prostate cancers have LOH in the 
adjacent chromosome band, 7q31) and in 10-55% of breast cancers, where again, there 
is even more frequent LOH in 7q3 1 . LOH at 7q21 has also been reported in uterine 
cancer and head and neck cancer. Several other cancer types have not yet been well 
studied for LOH affecting this region. 

Example 46. Methionine Synthase (MS). 
Variagenics Target Gene 

Methionine Synthase is conditionally essential in dividing cells 

Cells require a continuous supply of amino acids for protein biosynthesis. L- 
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methionine is one of ten essential amino acids. Consequently dividing cells must 
obtain their methionine from serum via amino acid transporter (the only source besides 
protein catabolism for the ten essential amino acids). Alterations in the nutritional 
environment of growing cells that result in a decreased extracellular concentration of 
essential amino acids such as methionine cause arrested cell growth and may result in 
cell death. Cancer cells are particularly sensitive to methionine deprivation. (Tan, Y., 
et al., Anticancer Efficacy of Methioninase in vivo. Anticancer Research 16; 393 1- 
3936.) 



The cellular requirement for methionine can be bypassed: if L-homocysteine is 
provided to cells it can be methylated to form methionine by the enzyme methionine 
synthase (MS). In this reaction the methyl group is provided by 5- 
methyltetrahydrofolate and MS-bound methylcobalamin serves as an intermediate 
methyl carrier. A second enzyme may be required for reductive activation of 
methionine synthase, based on complementation studies. 

It occured to the inventors that the apparent antineoplastic effects of methionine 
deprivation could be enhanced and made tumor cell specific by preventing cells from 
converting endogenous homocysteine to methionine by allele specific inhibition of 
methionine synthase (or other enzymes required for the conversion of homocysteine 
to methionine; see: Scriver, C, et al., editors, The Metabolic and Molecular Basis of 
Inherited Disease McGraw Hill, New York, pp. 3 1 1 1 -3 1 28 and 3 1 29-3 1 49). This 
strategy would be useful in cancer patients that are heterozygous for methionine 
synthase (or another enzyme required for conversion of homocysteine to methionine) 
and who have LOH at the methionine synthase (or other) gene locus. In such patients 
an allele specific inhibitor of MS directed to the sole allele present in cancer cells, 
coupled with methionine starvation or methioninase treatment, would selectively 
prevent tumor cells from responding to methionine deprivation. The provision of 
supplemental homocysteine, which could only be converted to methionine by the 
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normal cells, would provide a way to amplify the differential toxicity to cancer cells. 
Also, the methionine analog ethionine has been shown to potentiate the effects of 
methionine starvation. (Poirson-Bichat, F., et ah, Growth of methionine-dependent 
human prostate cancer (PC-3) is inhibited by ethionine combined with methionine 
5 starvation. Br. J. Cancer 75: 1605-1612.) Ethionine or similar agents could be used 

in conjunction with an allele specific inhibitor of methionine synthesis. 

An alternative approach to allele specific therapy of cancer cells with LOH would be 
to target the amino acid transport system for methionine in patients heterozygous for 
10 this protein and in whom only one allele is present in cancer tissue as a result of LOH. 

This would result in selective methionine starvation for cancer cells. Allele specific 
transport inhibition could be combined with methionine starvation or methioninase 
treatment to enhance the cytotoxic effect. 

15 The Methionine Synthase gene maps to chromosome lq43, a region subject to LOH in 

several cancers 

The MS gene has been mapped to chromosome lq43 by fluoresence in situ 
hybridization. The q43 region of chromosome 1 is subject to frequent LOH 
20 particularly in colon, head and neck, ovarian and liver cancers, where LOH frequencies 

vary from 11 to 39%. LOH at lq43 has also been reported in cervix, pancreas, 
stomach and testis cancers. Several other cancer types have not yet been well studied 
for LOH in this region. 

25 Other amino acid biosynthetic enzymes are candidates for allele specific inhibition 

It will be evident to one skilled in the art that strategies similar to those described 
above for asparagine (an essential amino acid) and methionine (a non-essential amino 
acid) could be undertaken for other amino acid biosynthetic enzymes. For example, 
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L-glutaminase has also been shown to have antiproliferative effects on mammalian cell 
growth. Allele specific blockade of glutamine synthesis in heterozygous patients with 
LOH for genes essential for glutamine synthesis could be the basis of a cancer specific 
therapy. 



Example 47. Methylthioadenosine phosphorylase (MTAP). 
Variagenics Target Gene 

10 

Methylthioadenosine phosphorylase can convert methylthioadenosine to methionine, 
an essential amino acid 

Cells require a continuous supply of amino acids for protein biosynthesis. L- 
15 methionine is one of ten essential amino acids. Consequently dividing cells must 

obtain methionine from serum via amino acid transporter (the only source besides 
protein catabolism or conversion of L-homocysteine). Alterations in the nutritional 
environment of growing cells that result in a decreased extracellular concentration of 
essential amino acids such as methionine cause arrested cell growth and may result in 
20 cell death. Cancer cells are particularly sensitive to methionine deprivation. (Tan, Y., 

et al., Anticancer Efficacy of Methioninase in vivo. Anticancer Research 16: 3931- 
3936.) 



The cellular requirement for methionine can be bypassed by conversion of L- 
homocysteine to methionine as discussed above. An alternative pathway for 
methionine synthesis is conversion of 5' -methylthioadenosine (5'-MTA) via the action 
of 5'-MTA phosphorylase (MTAP). (Tisdale, M.J., Methionine Synthesis from 5'- 
methylthioadenosine by Tumor Cells. Biochemical Pharmacology 32: 2915-2920.) 
In tissue culture experiments low concentrations of 5'-MTA can substitute for 
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methionine in some cell lines. Thus 5'-MTA can rescue cells from methionine 
deprivation. 

It occured to the inventors that allele specific inhibition of MTAP in cancer patients 
heterozygous for MTAP and whose cancer cells have only one allele of MTAP as a 
consequence of LOH, in combination with methionine deprivation (methionine 
starvation or L-methioninase treatment) and dietary supplementation with 5'- 
methylthioadenosine would provide a source of convertible methionine substrate 
selectively useful to normal cells. Tumor cells would have no source of methionine, 
being unable to convert the 5'-methylthioadenosine, and hence would be selectively 
poisoned. This therapeutic strategy would not necessarily require an allele specific 
inhibitor as all copies of MTAP are deleted in some cancers. Such cancers should be 
differentially poisoned vis a vis normal cells by methionine deprivation in the presence 
of S'-methylthioadenosine. 

The MTAP gene maps to 9p21, a region frequently subject to LOH in many cancers 

The MTAP gene has been mapped to chromosome 9p21 by physical techniques 
(pulsed field gel electrophoresis and yeast artificial chromosome mapping). The gene 
lies near the cyclin dependent kinase inhibitors p!6 and pi 5 which are frequently 
reduced to one or zero copies in cancer cells. (Nobori, et al., Genomic cloning of 
methylthioadenosine phosphorylase: a purine metabolic enzyme deficient in multiple 
different cancers. Proc. Natl. Acad. Sci. U.S.A. 93: 6203-6208.) The p21 region of 
chromosome 9 is subject to frequent LOH particularly in cancers of the bladder, breast, 
esophagus, head and neck, kidney, lung, melanoma and ovary. The frequency of LOH 
in these cancers ranges from 20% to nearly 100%. 
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Example 48. DNA dependent protein kinase (DNA-PK) and associated factors. 
Variagenics Target Genes 

DNA dependent protein kinase is conditionally essential 

Cells exposed to ionizing radiation, such as gamma radiation, are damaged by base 
modifications and DNA strand breaks. Double strand DNA breaks are among the most 
lethal form of radiation damage; one such break, if unrepaired, can be cell lethal. Four 
complementation groups of mammalian cell mutants that are defective in repair of 
double strand (ds) breaks have been identified. All four complementation groups are 
hypersensitive to ionizing radiation. The loci for three of these groups have been 
shown to encode components of DNA-dependent protein kinase (DNA-PK). The 
fourth group is deficient in the gene encoding XRCC4, a factor that associates with and 
stimulates DNA Ligase IV. Ligation of ds breaks by DNA ligase IV in a cell free 
system in increased 7-8 fold by co-expression of XRCC4. 

DNA-PK is a multiprotein complex with a DNA binding regulatory subunit, the Ku 
heterodimer [Ku70 (XRCC6) and Ku80, also referred to as Ku86 (XRCC5)], and a 
catalytic subunit, DNA-PKcs (probably XRCC7), that is activated by the regulatory 
subunit upon binding to DNA ds ends, with consequent expression of serine/threonine 
kinase activity resulting in phosphorylation of a variety of DNA binding proteins. A 
fourth protein called KARP-1 is expressed from the Ku80/86 locus and is also 
implicated in DNA-PK function. 

Cells lacking any of the components of DNA-PK are exquisitely sensitive to gamma 
irradation. This has been demonstrated directly in mice with targeted disruption of the 
Ku80/86 and DNA-PKcs genes. The Ku80/86 deficient mice were also sensitive to 
methyl methane sulfonate, a DNA alkylating agent that induces single strand breaks 
and to etoposide, a topoisomerase II inhibitor. Thus the components of DNA-PK can 
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also be important for repair of a variety of chemically induced DNA lesions as well as 
ionizing radiation. 

In a cancer patient with two alternative alleles for a component of DNA-PK and LOH 
5 at the heterozygous locus, an allele specific inhibitory drug could be used to sensitize 

cancer cells to the action of ds break inducing treatments. Such a drug could be used 
to treat cancer patients constitutionally heterozygous for two normal alleles at any of 
the DNA-PK loci in whom LOH had rendered cancer cells hemizygous or homozygous 
for one allele. Treatment would consist in the administration of the appropriate allele 

10 specific inhibitor plus a ds break inducing agent or procedure. The tumor cells would 

be unable to effectively repair ds breaks, while the uninhibited allele in normal cells 
would be able to function. Alternatively, patients heterozygous for functional and 
defective copies of genes required for repair of strand breaks, and in whom LOH 
resulted in loss of the functional allele, could be treated by a strand break inducing 

15 procedure without the necessity for an allele specific inhibitor. Identification of such 

patients would require a test for heterozygosity at the target locus and a test for LOH 
which could show which allele is deleted in cancer cells. Such an approach would be 
expected to identify patients likely to respond well to strand breaking agents or 
procedures (exposure to ionizing radiation) even though they might have cancers not 

20 traditionally treated with such measures. 

The genes encoding constituents of DNA-PK map to chromosomes frequently subject 
to LOH in different cancers 

25 The DNA-PKcs gene has been mapped to 8ql 1, the Ku80/86 gene to 2ql l-ql3 and the 

Ku70 gene to 22ql 1 -q 1 3 . All three regions are subject to LOH in different cancers. 
LOH on 2q has been reported in lung ovary and cervical cancers at frequencies ranging 
from 1 1% to 39%. LOH for 8q has been reported in cervix, head and neck, kidney, 
lung, ovary, prostate and testis cancers at frequencies ranging from 20% to 50% of 
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cancers. LOH on 22q has been reported in brain, breast colon, head and neck, lung, 
ovary, pediatric and stomach cancers at frequencies ranging from 10 to 76%. Several 
other cancer types have not yet been well studied for LOH affecting either region. 

5 Other proteins required for repair of DNA strand breaks are also candidates for allele 

specific therapy of cancer 

It will be evident to one skilled in the art that strategies similar to those described 
above for DNA-PK could be undertaken for other proteins required for repair of DNA 

10 strand breaks. For a recent review of such proteins see: Zdzienicka, M.Z., Mammalian 

mutants defective in the response to ionizing radiation-induced DNA damage. 
Mutation Research 336: 203-213, 1995; Thompson, L.H. and P.A. Jeggo, 
Nomenclature of human genes involved in ionizing radiation sensitivity. Mutation 
Research 337: 131-134, 1995; Thacker, J. and R.E. Wilkinson, The gentic basis of 

15 cellular recovery from radiation damage: response of the radiosensitive irs lines to low- 

dose rate irradiation. Radiation Research 144: 294-300, 1995. Two other syndromes 
with hypersensitivity to X-rays are Diamond-Blackfan anemia and aplastic anemia 
(Diemen, P.C., X-ray-sensitivity of lymphocytes of aplastic- and Diamond-Blackfan- 
anemia patients as detected by conventional cytogentic and chromosome painting 

20 techniques. Mutation Resarch 373: 225-235, 1997). Recently evidence of several 

other genes responsible for DNA double strand break repair has been described. 
(Nicolas, N., Finnie, N.J., et al., Eur. J. Immunol. 26:1 1 18-1 122, 1996.) The above 
genes which, when defective, sensitize cells to toxic effects of DNA strand breaking 
agents would be amenable to the therapeutic strategy outlined above for the DNA-PK 

25 genes. Specifically, in a patient with two alternative alleles for such a gene and LOH 

at the relevant locus, an allele specific drug could be used to sensitize cancer cells to 
the action of strand breaking agents. Such drugs could then be used to treat cancer 
patients constitutionally heterozygous for two normal alleles at the relevant locus, in 
whom LOH had rendered cancer cells hemizygous or homozygous for one allele. 
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Treatment would consist in the administration of the appropriate allele specific 
inhibitor plus a strand breaking agent or treatment to induce damage in all cells. 
Cancer cells (but not normal cells) would be rendered unable to respond by inhibition 
of expression of the relevant repair gene. 

5 

Alternatively, patients heterozygous for functional and defective copies of genes 
required for repair of strand breaks, and in whom LOH resulted in loss of the 
functional allele, could be treated by a strand break inducing procedure without the 
necessity for an allele specific inhibitor. Identification of such patients would require 
10 a test for heterozygosity at the target locus and a test for LOH which could show which 

allele is deleted in cancer cells. Such an approach would be expected to identify 
patients likely to respond well to strand breaking agents or procedures (exposure to 
ionizing radiation) even though they might have cancers not traditionally treated with 
such measures. 



Example 49. Ataxia Telangiectasia Mutated (ATM) and c-Abl 
Variagenics Target Gene 

20 

The Ataxia Telangiectasia gene is essential for cell growth or survival in the presence 
of ionizing radiation or DNA damaging molecules 

In order for cells to survive or proliferate in the presence of ionizing radiation (IR) or 
25 radiomimetic chemicals it is necessary that they are capable of efficiently repairing IR 

induced damage. Cells contain proteins necessary for such repair. One way such 
proteins can be identified is by their absence in specific patients who are particularly 
susceptible to the toxic effects of IR. 
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Ataxia Telangiectasia (AT) is a genetically transmitted autosomal recessive disorder 
characterized by variable degrees of immunodeficiency, telagiectasia (small blood 
vessels growing near the surface of the skin or eye), cerebellar ataxia (loss of balance 
due to abnormal development of the cerebellum) and increased sensitivity to both 

5 ionizing radiation and radiomimetic drugs, including bleomycin; AT cells are killed 

by lower doses of ionizing radiation or radiomimetic drugs than normal cells. Further, 
heterozygotes for mutant and normal AT alleles have radiation sensitivity close to that 
of homozygous normals. Therefore cancer cells from individuals heterozygous for null 
alleles of the AT gene (called ATM) should be highly susceptible to radiation therapy 

0 when only the deficient AT allele remains in cancer cells due to LOH, compared to 

normal cells from the same patients. Such patients could be treated by a DNA damage 
inducing procedure without the necessity for an allele specific inhibitor. Identification 
of such patients would require a test for heterozygosity at the target locus and a test for 
LOH which could show which allele is deleted in cancer cells. Such an approach 

5 would be expected to identify patients likely to respond well to strand breaking agents 

or procedures (such as exposure to ionizing radiation) even though they might have 
cancers not traditionally treated with such measures. In a related aspect, this approach 
is applicable to heterozygotes for other genes associated with ATM-mediated 
radiosensitivity. One such protein is the c-Abl protein tyrosine kinase, which binds to 

^0 the ATM protein and regulates its function. c-Abl is known to be important in the 

stress response to ionizing radiation. One of its functions is activation of stress 
activated protein kinases (SAPKs) after irradiation or exposure to alkylating agents 
such as c/s-platinum or mitomycin C, a response that is defective in ATM cells. 
Correction of the SAPK activation defect in ATM cells by non-mutant ATM cDNA 

5 suggests that the ATM - c-Abl interaction is necesary for the DNA damage response. 

(Kharbanda, S., et al. Nature 376: 785-788, 1995.) 

In a cancer patient with two alternative functional alleles for a component of ATM and 
LOH at the ATM locus, an allele specific inhibitory drug could be used to sensitize 
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cancer cells to the action of DNA damage inducing treatments such as ionizing 
radiation or radiomimetic drugs. Such an allele specific drug could be used to treat 
cancer patients constitutionally heterozygous for two normal ATM alleles in whom 
LOH had rendered cancer cells hemizygous or homozygous for one allele. Treatment 
would consist of the administration of the appropriate allele specific inhibitor plus a 
DNA damage inducing treatment or procedure. The tumor cells would be unable to 
effectively the DNA damage, while the uninhibited allele in normal cells would be able 
to function. A similar approach could be taken to 

The ATM gene is polymorphic 

The ATM cDNA is 9.58 kb. Several likely polymorphisms have been identified, 
although population studies have not yet been performed to determine allele 
frequencies. One of the reported polymorphisms, an ATG to ATA change in codon 
847, results in a methionine vs. isoleucine difference. Thus ATM is potentially 
targetable at the DNA, RNA and protein levels. It is likely that additional variances 
will be identified with broader population surveys and computational variance 
detection. 

The ATM gene maps to chromosome llq23 and the c-Abl gene maps to 9q34.1 f two 
regions of high frequency LOH in different cancer types 

Chromosome 9q34 is lost in a high fraction of bladder, esophagus, ovary, head & neck 
and testis cancers (17 - 76%) and in a lesser fraction of breast, liver and prostate 
cancers and leukemias. Chromosome 1 lq23 is lost in brain, cervix, esophagus, breast, 
kidney, colon, stomach, head & neck and lung cancers at frequencies ranging from 
16% to 100%. 

Other proteins required for repair of DNA damage are also candidates for allele 
specific therapy of cancer 
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It will be evident to one skilled in the art that strategies similar to those described 
above for ATM and c-Abl could be undertaken for other proteins required for the stress 
response to DNA damaging agents, such as other stress activated protein kinases or 
downstream effector proteins. 
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Methylguanine Methyltransferase (MGMT) 
Gene VARIA 1534 

The methylguanine methyltransferase gene is essential for cell growth or survival in 
5 the presence of alkylating agents 

Methylguanine methyltransferase (MGMT) is a suicide protein that repairs alkylating 
agent damage, specifically alkylation of the 6 0 position of guanine. Alkyl groups are 
covalently bound to an active site cysteine (residue 145) of MGMT, thereby 
10 irreversibly inactivating the protein. 6 0-benzylguanine is an analog inhibitor of 

MGMT that, by inactivating MGMT, renders tumor cells more sensitive to the toxic 
effects of methylating and chloroethylating agents. MGMT is thus a conditionally 
essential gene in the presence of such drugs. 6 0-benzylguanine is being developed as 
a chemosensitizing agent. 

15 

In a cancer patient with two alternative functional MGMT alleles an allele specific 
inhibitory drug could be used to sensitize cancer cells to the action of alkylating agents. 
Such an allele specific drug could be used to treat cancer patients constitutionally 
heterozygous for two normal MGMT alleles in whom LOH had rendered cancer cells 
20 hemizygous or homozygous for one allele. Treatment would consist of the 

administration of the appropriate allele specific inhibitor plus an alkylating agent. The 
tumor cells would be unable to effectively repair the alkylating agent induced DNA 
damage, while the uninhibited allele in normal cells would be able to function. 

25 The MGMT gene is polymorphic 

Several variances have been reported in human MGMT, or discovered by Variagenics, 
including three protein polymorphisms. There is a silent C/T variance at position 255 
(1 1% heterozygotes among 36 individuals surveyed), another C/T variance at nt. 346 
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which results in a L84F amino acid variance (5% heterozygoses), an A/G variance at 
nt. 523 which results in a 1 143 V amino acid variance (24% heterozygotes). A variance 
has been reported in Japanese at codon 160, GGA vs. AGA, converting glycine to 
arginine. 15% of the population studied were heterozygotes. 

5 

The alteration of glycine 160 to arginine reduced the inactivation by 06-benzylguanine 
with an approximately 20 fold increase in the IC50 concentration. These results 
demonstrate variance-specific effects of a small molecule, 06-benzylguanine, on 
normal (non-mutant) alleles of the conditionally essential MGMT gene. 

10 

Administration of 06 benzylguanine to patients who are heterozygous for the residue 
160 gly/arg variance in their normal cells, and contain only the form of the gene with 
a glycine residue at position 160 in their cancer cells, together with methylating or 
chloroethylating agents for chemotherapy, will be specifically toxic to cancer cells 
15 without increasing toxicity to normal cells. 
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All patents and publications mentioned in the specification are indicative of the levels 
of skill of those skilled in the art to which the invention pertains. All references 
cited in this disclosure are incorporated by reference to the same extent as if each 
reference had been incorporated by reference in its entirety individually. 

One skilled in the art would readily appreciate that the present invention is well 



WO 98/41648 



PCT/US98/05419 



332 232/116 
adapted to carry out the objects and obtain the ends and advantages mentioned, as 
well as those inherent therein. The groups of genes and the particular genes 
described herein as presently representative of preferred embodiments are exemplary 
and are not intended as limitations on the scope of the invention. Changes therein 
and other uses will occur to those skilled in the art, which are encompassed within 
the spirit of the invention, are defined by the scope of the claims. 

It will be readily apparent to one skilled in the art that varying substitutions and 
modifications may be made to the invention disclosed herein without departing from 
the scope and spirit of the invention. For example, those skilled in the art will 
readily recognize that the methods and inhibitors can utilize a variety of different 
target genes within the groups described. Thus, such additional embodiments are 
within the scope of the present invention and the following claims. 

The invention illustratively described herein suitably may be practiced in the absence 
of any element or elements, limitation or limitations which is not specifically 
disclosed herein. Thus, for example, in each instance herein any of the terms 
"comprising", "consisting essentially or and "consisting or may be replaced with 
either of the other two terms. The terms and expressions which have been employed 
are used as terms of description and not of limitation, and there is no intention that 
in the use of such terms and expressions of excluding any equivalents of the features 
shown and described or portions thereof, but it is recognized that various 
modifications are possible within the scope of the invention claimed. Thus, it should 
be understood that although the present invention has been specifically disclosed by 
preferred embodiments and optional features, modification and variation of the 
concepts herein disclosed may be resorted to by those skilled in the art, and that such 
modifications and variations are considered to be within the scope of this invention 
as defined by the appended claims. 
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In addition, where features or aspects of the invention are described in terms of 
Markush groups or other grouping of alternatives, those skilled in the art will 
recognize that the invention is also thereby described in terms of any individual 
member or subgroup of members of the Markush group or other group. 

Thus, additional embodiments are within the scope of the invention and within the 
following claims. 
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CLAIMS 



What we claim is: 



1 . A method for identifying an inhibitor potentially usefiil for treatment of cancer, 
wherein said inhibitor is active on a gene vital for cell growth or viability, and wherein 
said gene is subject to loss of heterozygosity in a cancer, said method comprising the 
steps of: 

(a) determining at least two alleles of a said gene, wherein said gene 
encodes a product required for cell proliferation; 

(b) testing a potential allele specific inhibitor to determine whether said 
potential allele specific inhibitor is active on at least one but less than all of said alleles; 

wherein inhibition of expression of at least one but less than all of said alleles 
or reduction of the level of activity of a product of at least one but less than all of said 
alleles in the presence of said potential allele specific inhibitor is indicative that said 
potential allele specific inhibitor is a said inhibitor. 

2. A method for identifying an inhibitor potentially useful for treatment of 
cancer, wherein said inhibitor is active on a gene vital for cell growth or viability, 
and wherein said gene is subject to loss of heterozygosity in a cancer, said method 
comprising the steps of: 

(a) determining at least two alleles of a said gene, wherein said gene 
encodes a product required to maintain inorganic ions and vitamins at levels 
compatible with cell growth or survival; 

(b) testing a potential allele specific inhibitor to determine whether said 
potential allele specific inhibitor is active on at least one but less than all of said 
alleles; 

wherein inhibition of expression of at least one but less than all of said alleles 
or reduction of the level of activity of a product of at least one but less than all of 
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said alleles in the presence of said potential allele specific inhibitor is indicative that 
said potential allele specific inhibitor is a said inhibitor. 

3. A method for identifying an inhibitor potentially usefiil for treatment of 
5 cancer, wherein said inhibitor is active on a gene vital for cell growth or viability, 

and wherein said gene is subject to loss of heterozygosity in a cancer, said method 
comprising the steps of: 

(a) determining at least two alleles of a said gene, wherein said gene 
encodes a product required to maintain organic compounds at levels compatible with 

10 cell growth or survival; 

(b) testing a potential allele specific inhibitor to determine whether said 
potential allele specific inhibitor is active on at least one but less than all of said 
alleles; 

wherein inhibition of expression of at least one but less than all of said alleles 
15 or reduction of the level of activity of a product of at least one but less than all of 

said alleles in the presence of said potential allele specific inhibitor is indicative that 
said potential allele specific inhibitor is a said inhibitor. 

4. A method for identifying an inhibitor potentially useful for treatment of 
20 cancer, wherein said inhibitor is active on a gene vital for cell growth or viability, 

and wherein said gene is subject to loss of heterozygosity in a cancer, said method 
comprising the steps of: 

(a) determining at least two alleles of a said gene, wherein said gene 
encodes a product required to maintain cellular proteins at levels compatible with cell 

25 growth or survival; 

(b) testing a potential allele specific inhibitor to determine whether said 
potential allele specific inhibitor is active on at least one but less than all of said 
alleles; 

wherein inhibition of expression of at least one but less than all of said alleles 
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or reduction of the level of activity of a product of at least one but less than all of 
said alleles in the presence of said potential allele specific inhibitor is indicative that 
said potential allele specific inhibitor is a said inhibitor. 

5 5. A method for identifying an inhibitor potentially useful for treatment of 

cancer, wherein said inhibitor is active on a gene vital for cell growth or viability, 
and wherein said gene is subject to loss of heterozygosity in a cancer, said method 
comprising the steps of: 

(a) determining at least two alleles of a said gene, wherein said gene 
10 encodes a product required to maintain cellular nucleotides at levels compatible with 

cell growth or survival; 

(b) testing a potential allele specific inhibitor to determine whether said 
potential allele specific inhibitor is active on at least one but less than all of said 
alleles; 

15 wherein inhibition of expression of at least one but less than all of said alleles 

or reduction of the level of activity of a product of at least one but less than all of 
said alleles in the presence of said potential allele specific inhibitor is indicative that 
said potential allele specific inhibitor is a said inhibitor. 

20 6. A method for identifying an inhibitor potentially useful for treatment of 

cancer, wherein said inhibitor is active on a gene vital for cell growth or viability, 
and wherein said gene is subject to loss of heterozygosity in a cancer, said method 
comprising the steps of: 

(a) determining at least two alleles of a said gene, wherein said gene 
25 encodes a product required to maintain the integrity and function of cellular and 

subcellular structures; 

(b) testing a potential allele specific inhibitor to determine whether said 
potential allele specific inhibitor is active on at least one but less than all of said 
alleles; 
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wherein inhibition of expression of at least one but less than all of said alleles 
or reduction of the level of activity of a product of at least one but less than all of 
said alleles in the presence of said potential allele specific inhibitor is indicative that 
said potential allele specific inhibitor is a said inhibitor. 

7. A method for identifying an inhibitor potentially useful for treatment of 
cancer, wherein said inhibitor is active on a gene vital for cell growth or viability, 
and wherein said gene is subject to loss of heterozygosity in a cancer, said method 
comprising the steps of: 

(a) determining at least two alleles of a said gene, wherein said gene is 
located on a high frequency LOH chromosomal region; 

(b) testing a potential allele specific inhibitor to determine whether said 
potential allele specific inhibitor is active on at least one but less than all of said 
alleles; 

wherein inhibition of expression of at least one but less than all of said alleles 
or reduction of the level of activity of a product of at least one but less than all of 
said alleles in the presence of said potential allele specific inhibitor is indicative that 
said potential allele specific inhibitor is a said inhibitor. 

8. The method of claim 7, wherein said gene is located on a chromosomal arm 
which has a frequency of allele loss of at least 15% in a cancer. 

9. The method of claim 7, wherein said gene is located in proximity to a 
chromosomal marker which undergoes LOH at a frequency of at least 10% in a 
cancer. 

10. The method of claim 7, wherein said gene is located in proximity to a tumor 
suppressor gene which undergoes LOH at a frequency of at least 10% in a cancer. 
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11. A method for identifying an inhibitor potentially useful for treatment of 
cancer, wherein said inhibitor is active on a gene vital for cell growth or viability, 
and wherein said gene is subject to loss of heterozygosity in a cancer, said method 
comprising the steps of: 

(a) determining at least two alleles of a said gene, wherein said gene has 
at least two sequence variances which occur at frequences such that at least 10% of 
a population is heterozygous for said gene; 

(b) testing a potential allele specific inhibitor to determine whether said 
potential allele specific inhibitor is active on at least one but less than all of said 
alleles; 

wherein inhibition of expression of at least one but less than all of said alleles 
or reduction of the level of activity of a product of at least one but less than all of 
said alleles in the presence of said potential allele specific inhibitor is indicative that 
said potential allele specific inhibitor is a said inhibitor. 

12. The method of claim 11, wherein said gene is located on a high frequency 
LOH chromosomal region. 

13. An inhibitor potentially useful for treatment of cancer, wherein said inhibitor 
is active on an allelic form of a gene vital for cell viability or cell growth, wherein 
said gene encodes a product required for cell proliferation, said gene has at least two 
alternative alleles in a population, and 

wherein said inhibitor targets at least one but less than all of said alternative 

alleles. 

14. An inhibitor potentially useful for treatment of cancer, wherein said inhibitor 
is active on an allelic form of a gene vital for cell viability or cell growth,wherein 
said gene encodes a product required to maintain inorganic ions and vitamins at 
levels compatible with cell growth or survival, said gene has at least two alternative 
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alleles in a population, and 

wherein said inhibitor targets at least one but less than all of said alternative 

alleles. 



15. An inhibitor potentially useful for treatment of cancer, wherein said inhibitor 
is active on an allelic form of a gene vital for cell viability or cell growth, wherein 
said gene encodes a product required to maintain organic compounds at levels 
10 compatible with cell growth or survival, said gene has at least two alternative alleles 

in a population, and 

wherein said inhibitor targets at least one but less than all of said alternative 

alleles. 

15 16. An inhibitor potentially useful for treatment of cancer, wherein said inhibitor 

is active on an allelic form of a gene vital for cell viability or cell growth, wherein 
said gene encodes a product required to maintain cellular proteins at levels 
compatible with cell growth or survival, said gene has at least two alternative alleles 
in a population, and 

20 wherein said inhibitor targets at least one but less than all of said alternative 

alleles. 

17. An inhibitor potentially useful for treatment of cancer, wherein said inhibitor 
is active on an allelic form of a gene vital for cell viability or cell growth, wherein 
25 said gene encodes a product required to maintain cellular nucleotides at levels 

compatible with cell growth or survival, said gene has at least two alternative alleles 
in a population, and 

wherein said inhibitor targets at least one but less than all of said alternative 

alleles. 
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18. An inhibitor potentially useful for treatment of cancer, wherein said inhibitor 
is active on an allelic form of a gene vital for cell viability or cell growth, wherein 
said gene encodes a product required to maintain the integrity and function of 
cellular and subcellular structures, said gene has at least two alternative alleles in a 
5 population, and 

wherein said inhibitor targets at least one but less than all of said alternative 

alleles. 



19. An inhibitor potentially useful for treatment of cancer, wherein said inhibitor 
10 is active on an allelic form of a gene vital for cell viability or cell growth, wherein 

said gene is located on a high frequency LOH chromosomal arm region, said gene 
has at least two alternative alleles in a population, and 

wherein said inhibitor targets at least one but less than all of said alternative 

alleles. 

15 

20. An inhibitor potentially useful for treatment of cancer, wherein said inhibitor 
is active on an allelic form of a gene vital for cell viability or cell growth, wherein 
said gene has at least two sequence variances which occur at frequences such that at 
least 10% of a population is heterozygous for said gene, said gene has at least two 

20 alternative alleles in a population, and 

wherein said inhibitor targets at least one but less than all of said alternative 

alleles. 



21. A pharmaceutical composition, comprising 
25 at least one allele specific inhibitor targeting at least one but less than all 

allelic forms of an essential gene in a population, wherein said gene encodes a 
product required for cell proliferation; and 

a pharmaceutical^ acceptable carrier or excipient. 
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22. A pharmaceutical composition, comprising 

at least one allele specific inhibitor targeting at least one but less than all 
allelic forms of an essential gene in a population, wherein said gene encodes a 
product required to maintain inorganic ions and vitamins at levels compatible with 
5 cell growth or survival; and 

a pharmaceutically acceptable carrier or excipient. 



10 23. A pharmaceutical composition, comprising 

at least one allele specific inhibitor targeting at least one but less than all 
allelic forms of an essential gene in a population, wherein said gene encodes a 
product required to maintain organic compounds at levels compatible with cell 
growth or survival; and 

15 a pharmaceutically acceptable carrier or excipient. 

24. A pharmaceutical composition, comprising 

at least one allele specific inhibitor targeting at least one but less than all 
allelic forms of an essential gene in a population, wherein said gene encodes a 
20 product required to maintain cellular proteins at levels compatible with cell growth 

or survival; and 

a pharmaceutically acceptable carrier or excipient, 

25. A pharmaceutical composition, comprising 

25 at least one allele specific inhibitor targeting at least one but less than all 

allelic forms of an essential gene in a population, wherein said gene encodes a 
product required to maintain cellular nucleotides at levels compatible with cell 
growth or survival; and 

a pharmaceutically acceptable carrier or excipient. 
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26. A pharmaceutical composition, comprising 

at least one allele specific inhibitor targeting at least one but less than all 
allelic forms of an essential gene in a population, wherein said gene encodes a 
product required to maintain the integrity and function of cellular and subcellular 
structures; and 

a pharmaceutically acceptable carrier or excipient. 



27. A pharmaceutical composition, comprising 

at least one allele specific inhibitor targeting at least one but less than all 
allelic forms of an essential gene in a population, wherein said gene is located on a 
high frequency LOH chromosomal arm region; and 

a pharmaceutically acceptable carrier or excipient. 

28. A pharmaceutical composition, comprising 

at least one allele specific inhibitor targeting at least one but less than all 
allelic forms of an essential gene in a population, wherein said gene has at least two 
sequence variances which occur at frequences such that at least 10% of a population 
is heterozygous for said gene; and 

a pharmaceutically acceptable carrier or excipient. 

29. A method for producing an inhibitor potentially useful for cancer treatment, 
wherein said inhibitor is active on at least one but less than all alternative alleles of 
a gene having at least two alternative alleles, comprising the steps of: 

(a) identifying a gene vital to cell viability or cell growth that has alternative 
allelic forms in a noncancerous cell, wherein one of said alternative allelic forms is 
deleted in a cancer cell, and wherein said gene encodes a product required for cell 
proliferation; 
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(b) screening to identify an inhibitor which inhibits said at least one but less 
than all of said at least two alternative alleles; and 

(c) synthesizing said inhibitor in an amount sufficient to produce a therapeutic 
effect when administered to a patient suffering from a cancer in which cancerous 
cells have only the allele of said gene inhibited by said inhibitor and in whom normal 
cells are heterozygous for said gene. 

30. A method for producing an inhibitor potentially useful for cancer treatment, 
wherein said inhibitor is active on at least one but less than all alternative alleles of 
a gene having at least two alternative alleles, comprising the steps of: 

(a) identifying a gene vital to cell viability or cell growth that has alternative 
allelic forms in a noncancerous cell, wherein one of said alternative allelic forms is 
deleted in a cancer cell, and wherein said gene encodes a product required to 
maintain inorganic ions and vitamins at levels compatible with cell growth or 
survival; 

(b) screening to identify an inhibitor which inhibits said at least one but less 
than all of said at least two alternative alleles; and 

(c) synthesizing said inhibitor in an amount sufficient to produce a therapeutic 
effect when administered to a patient suffering from a cancer in which cancerous 
cells have only the allele of said gene inhibited by said inhibitor and in whom normal 
cells are heterozygous for said gene. 

31. A method for producing an inhibitor potentially useful for cancer treatment, 
wherein said inhibitor is active on at least one but less than all alternative alleles of 
a gene having at least two alternative alleles, comprising the steps of: 

(a) identifying a gene vital to cell viability or cell growth that has alternative 
allelic forms in a noncancerous cell, wherein one of said alternative allelic forms is 
deleted in a cancer cell, and wherein said gene encodes a product required to 
maintain organic compounds at levels compatible with cell growth or survival; 
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(b) screening to identify an inhibitor which inhibits said at least one but less 
than all of said at least two alternative alleles; and 

(c) synthesizing said inhibitor in an amount sufficient to produce a therapeutic 
effect when administered to a patient suffering from a cancer in which cancerous 

5 cells have only the allele of said gene inhibited by said inhibitor and in whom normal 

cells are heterozygous for said gene. 

32. A method for producing an inhibitor potentially useful for cancer treatment, 
wherein said inhibitor is active on at least one but less than all alternative alleles of 

10 a gene having at least two alternative alleles, comprising the steps of: 

(a) identifying a gene vital to cell viability or cell growth that has alternative 
allelic forms in a noncancerous cell, wherein one of said alternative allelic forms is 
deleted in a cancer cell, and wherein said gene encodes a product required to 
maintain cellular proteins at levels compatible with cell growth or survival; 

15 (b) screening to identify an inhibitor which inhibits said at least one but less 

than all of said at least two alternative alleles; and 

(c) synthesizing said inhibitor in an amount sufficient to produce a therapeutic 
effect when administered to a patient suffering from a cancer in which cancerous 
cells have only the allele of said gene inhibited by said inhibitor and in whom normal 

20 cells are heterozygous for said gene. 

33. A method for producing an inhibitor potentially useful for cancer treatment, 
wherein said inhibitor is active on at least one but less than all alternative alleles of 
a gene having at least two alternative alleles, comprising the steps of: 

25 (a) identifying a gene vital to cell viability or cell growth that has alternative 

allelic forms in a noncancerous cell, wherein one of said alternative allelic forms is 
deleted in a cancer cell, and wherein said gene encodes a product required to 
maintain cellular nucleotides at levels compatible with cell growth or survival; 

(b) screening to identify an inhibitor which inhibits said at least one but less 
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than all of said at least two alternative alleles; and 

(c) synthesizing said inhibitor in an amount sufficient to produce a therapeutic 
effect when administered to a patient suffering from a cancer in which cancerous 
cells have only the allele of said gene inhibited by said inhibitor and in whom normal 
5 cells are heterozygous for said gene. 

34. A method for producing an inhibitor potentially useful for cancer treatment, 
wherein said inhibitor is active on at least one but less than all alternative alleles of 
a gene having at least two alternative alleles, comprising the steps of: 

10 (*) identifying a gene vital to cell viability or cell growth that has alternative 

allelic forms in a noncancerous cell, wherein one of said alternative allelic forms is 
deleted in a cancer cell, and wherein said gene encodes a product required to 
maintain the integrity and function of cellular and subcellular structures; 

(b) screening to identify an inhibitor which inhibits said at least one but less 
15 than all of said at least two alternative alleles; and 

(c) synthesizing said inhibitor in an amount sufficient to produce a therapeutic 
effect when administered to a patient suffering from a cancer in which cancerous 
cells have only the allele of said gene inhibited by said inhibitor and in whom normal 
cells are heterozygous for said gene. 

20 

35 . A method for producing an inhibitor potentially useful for cancer treatment, 
wherein said inhibitor is active on at least one but less than all alternative alleles of 
a gene having at least two alternative alleles, comprising the steps of: 

(a) identifying a gene vital to cell viability or cell growth that has alternative 
25 allelic forms in a noncancerous cell, wherein one of said alternative allelic forms is 

deleted in a cancer cell, and wherein said gene is located on a high frequency LOH 
chromosomal arm region; 

(b) screening to identify an inhibitor which inhibits said at least one but less 
than all of said at least two alternative alleles; and 
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(c) synthesizing said inhibitor in an amount sufficient to produce a therapeutic 
effect when administered to a patient suffering from a cancer in which cancerous 
cells have only the allele of said gene inhibited by said inhibitor and in whom normal 
cells are heterozygous for said gene. 

36. A method for producing an inhibitor potentially useful for cancer treatment, 
wherein said inhibitor is active on at least one but less than all alternative alleles of 
a gene having at least two alternative alleles, comprising the steps of: 

(a) identifying a gene vital to cell viability or cell growth that has alternative 
allelic forms in a noncancerous cell, wherein one of said alternative allelic forms is 
deleted in a cancer cell, and wherein said gene has at least two sequence variances 
which occur at frequences such that at least 10% of a population is heterozygous for 
said gene; 

(b) screening to identify an inhibitor which inhibits said at least one but less 
than all of said at least two alternative alleles; and 

(c) synthesizing said inhibitor in an amount sufficient to produce a therapeutic 
effect when administered to a patient suffering from a cancer in which cancerous 
cells have only the allele of said gene inhibited by said inhibitor and in whom normal 
cells are heterozygous for said gene. 

37. A method for preventing the development of cancer in a patient having a 
precancerous condition, comprising the steps of: 

a. administering to said patient a therapeutic amount of a first allele specific 
inhibitor targeted to an allele of a first essential gene present in cells of said 
precancerous condition, wherein the normal somatic cells of said patient are 
heterozygous for said first gene, said inhibitor is active on at least one but less than 
all allelic forms of said gene present in a population and targets only one allelic form 
present in said normal somatic cells, and said first gene encodes a product required 
for cell proliferation; and 



WO 98/41648 



PCT/US98/05419 



347 232/116 

wherein cells of said precancerous condition have undergone LOH of said 
first gene. 

38. The method of claim 37, wherein the cells of said precancerous condition are 
not clonal from a single cell, further comprising the step of: 

b. serially administering to said patient at least one additional allele specific 
inhibitor, wherein each of said at least one additional allele specific inhibitors targets 
a different allele of an essential gene than is targeted by said first allele specific 
inhibitor, wherein said different allele may be a different allele of said first gene or 
an allele of a different essential gene, and wherein said patient is heterozygous for 
each targeted essential gene and each targeted essential gene has undergone LOH in 
cells of said precancerous condition. 



39. A method for preventing the development of cancer in a patient having a 
precancerous condition, comprising the steps of: 

a. administering to said patient a therapeutic amount of a first allele specific 
inhibitor targeted to an allele of a first essential gene present in cells of said 

20 precancerous condition, wherein the normal somatic cells of said patient are 

heterozygous for said first gene, said inhibitor is active on at least one but less than 
all allelic forms of said gene present in a population and targets only one allelic form 
present in said normal somatic cells, and said first gene encodes a product required 
to maintain inorganic ions and vitamins at levels compatible with cell growth or 

25 survival; and 

wherein cells of said precancerous condition have undergone LOH of said 
first gene. 

40. The method of claim 39, wherein the cells of said precancerous condition are 
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not clonal from a single cell, further comprising the step of: 

b. serially administering to said patient at least one additional allele specific 
inhibitor, wherein each of said at least one additional allele specific inhibitors targets 
a different allele of an essential gene than is targeted by said first allele specific 
inhibitor, wherein said different allele may be a different allele of said first gene or 
an allele of a different essential gene, and wherein said patient is heterozygous for 
each targeted essential gene and each targeted essential gene has undergone LOH in 
cells of said precancerous condition. 

41. A method for preventing the development of cancer in a patient having a 
precancerous condition, comprising the steps of: 

a. administering to said patient a therapeutic amount of a first allele specific 
inhibitor targeted to an allele of a first essential gene present in cells of said 
precancerous condition, wherein the normal somatic cells of said patient are 
heterozygous for said first gene, said inhibitor is active on at least one but less than 
all allelic forms of said gene present in a population and targets only one allelic form 
present in said normal somatic cells, and said first gene encodes a product required 
to maintain organic compounds at levels compatible with cell growth or survival; and 

wherein cells of said precancerous condition have undergone LOH of said 
first gene. 

42. The method of claim 41 , wherein the cells of said precancerous condition are 
not clonal from a single cell, further comprising the step of: 

b. serially administering to said patient at least one additional allele specific 
inhibitor, wherein each of said at least one additional allele specific inhibitors targets 
a different allele of an essential gene than is targeted by said first allele specific 
inhibitor, wherein said different allele may be a different allele of said first gene or 
an allele of a different essential gene, and wherein said patient is heterozygous for 
each targeted essential gene and each targeted essential gene has undergone LOH in 
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cells of said precancerous condition. 

43. A method for preventing the development of cancer in a patient having a 
precancerous condition, comprising the steps of: 

a. administering to said patient a therapeutic amount of a first allele specific 
inhibitor targeted to an allele of a first essential gene present in cells of said 
precancerous condition, wherein the normal somatic cells of said patient are 
heterozygous for said first gene, said inhibitor is active on at least one but less than 
all allelic forms of said gene present in a population and targets only one allelic form 
present in said normal somatic cells, and said first gene encodes a product required 
to maintain cellular proteins at levels compatible with cell growth or survival; and 

wherein cells of said precancerous condition have undergone LOH of said 
first gene. 

44. The method of claim 43, wherein the cells of said precancerous condition are 
not clonal from a single cell, further comprising the step of: 

b. serially administering to said patient at least one additional allele specific 
inhibitor, wherein each of said at least one additional allele specific inhibitors targets 
a different allele of an essential gene than is targeted by said first allele specific 
inhibitor, wherein said different allele may be a different allele of said first gene or 
an allele of a different essential gene, and wherein said patient is heterozygous for 
each targeted essential gene and each targeted essential gene has undergone LOH in 
cells of said precancerous condition. 

45. A method for preventing the development of cancer in a patient having a 
precancerous condition, comprising the steps of: 

a. administering to said patient a therapeutic amount of a first allele specific 
inhibitor targeted to an allele of a first essential gene present in cells of said 
precancerous condition, wherein the normal somatic cells of said patient are 
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heterozygous for said first gene, said inhibitor is active on at least one but less than 
all allelic forms of said gene present in a population and targets only one allelic form 
present in said normal somatic cells, and said first gene encodes a product required 
to maintain cellular nucleotides at levels compatible with cell growth or survival; and 
5 wherein cells of said precancerous condition have undergone LOH of said 

first gene. 

46. The method of claim 45, wherein the cells of said precancerous condition are 
not clonal from a single cell, further comprising the step of: 

10 b. serially administering to said patient at least one additional allele specific 

inhibitor, wherein each of said at least one additional allele specific inhibitors targets 
a different allele of an essential gene than is targeted by said first allele specific 
inhibitor, wherein said different allele may be a different allele of said first gene or 
an allele of a different essential gene, and wherein said patient is heterozygous for 

15 each targeted essential gene and each targeted essential gene has undergone LOH in 

cells of said precancerous condition. 



47. A method for preventing the development of cancer in a patient having a 

20 precancerous condition, comprising the steps of: 

a. administering to said patient a therapeutic amount of a first allele specific 
inhibitor targeted to an allele of a first essential gene present in cells of said 
precancerous condition, wherein the normal somatic cells of said patient are 
heterozygous for said first gene, said inhibitor is active on at least one but less than 

25 all allelic forms of said gene present in a population and targets only one allelic form 

present in said normal somatic cells, and said first gene encodes a product required 
to maintain the integrity and function of cellular and subcellular structures; and 

wherein cells of said precancerous condition have undergone LOH of said 
first gene. 
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48. The method of claim 47, wherein the cells of said precancerous condition are 
not clonal from a single cell, further comprising the step of: 

b. serially administering to said patient at least one additional allele specific 
inhibitor, wherein each of said at least one additional allele specific inhibitors targets 
5 a different allele of an essential gene than is targeted by said first allele specific 

inhibitor, wherein said different allele may be a different allele of said first gene or 
an allele of a different essential gene, and wherein said patient is heterozygous for 
each targeted essential gene and each targeted essential gene has undergone LOH in 
cells of said precancerous condition. 

10 

49. A method for preventing the development of cancer in a patient having a 
precancerous condition, comprising the steps of: 

a. administering to said patient a therapeutic amount of a first allele specific 
inhibitor targeted to an allele of a first essential gene present in cells of said 

15 precancerous condition, wherein the normal somatic cells of said patient are 

heterozygous for said first gene, said inhibitor is active on at least one but less than 
all allelic forms of said gene present in a population and targets only one allelic form 
present in said normal somatic cells, and said first gene is located on a high 
frequency LOH chromosomal arm region; and 

20 wherein cells of said precancerous condition have undergone LOH of said 

first gene. 

50. The method of claim 49, wherein the cells of said precancerous condition are 
not clonal from a single cell, further comprising the step of: 

25 b. serially administering to said patient at least one additional allele specific 

inhibitor, wherein each of said at least one additional allele specific inhibitors targets 
a different allele of an essential gene than is targeted by said first allele specific 
inhibitor, wherein said different allele may be a different allele of said first gene or 
an allele of a different essential gene, and wherein said patient is heterozygous for 
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each targeted essential gene and each targeted essential gene has undergone LOH in 
cells of said precancerous condition. 

51. A method for preventing the development of cancer in a patient having a 
precancerous condition, comprising the steps of: 

a. administering to said patient a therapeutic amount of a first allele specific 
inhibitor targeted to an allele of a first essential gene present in cells of said 
precancerous condition, wherein the normal somatic cells of said patient are 
heterozygous for said first gene, said inhibitor is active on at least one but less than 
all allelic forms of said gene present in a population and targets only one allelic form 
present in said normal somatic cells, and said first gene has at least two sequence 
variances which occur at frequences such that at least 10% of a population is 
heterozygous for said gene; and 

wherein cells of said precancerous condition have undergone LOH of said 
first gene. 

52 . The method of claim 5 1 , wherein the cells of said precancerous condition are 
not clonal from a single cell, further comprising the step of: 

b. serially administering to said patient at least one additional allele specific 
inhibitor, wherein each of said at least one additional allele specific inhibitors targets 
a different allele of an essential gene than is targeted by said first allele specific 
inhibitor, wherein said different allele may be a different allele of said first gene or 
an allele of a different essential gene, and wherein said patient is heterozygous for 
each targeted essential gene and each targeted essential gene has undergone LOH in 
cells of said precancerous condition. 

53. A method for treating a patient suffering from a cancer, wherein said patient 
is heterozygous for a gene vital for cell growth or viability, comprising the step of: 

administering a therapeutic amount of an allele specific inhibitor active on at 
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least one but less than all allelic forms of said gene present in a population, 

wherein said gene encodes a product required for cell proliferation, said allele 
specific inhibitor inhibits only one allelic form of said gene present in said patient, 
and said only one allelic form of said gene is present in cancer cells in said patient. 

5 

54. The method of claim 53, further comprising the steps of: 

(a) determining whether non-cancerous cells of said patient are 
heterozygous for a particular gene essential for cell growth or viability; or 

(b) determining whether cancerous cells of said patient have only one 
10 allele of said particular gene; or 

(c) both (a) and (b). 

55. A method for treating a patient suffering from a cancer, wherein said patient 
is heterozygous for a gene vital for cell growth or viability, comprising the step of: 

administering a therapeutic amount of an allele specific inhibitor active on at 
least one but less than all allelic forms of said gene present in a population, 

wherein said gene encodes a product required to maintain inorganic ions and 
vitamins at levels compatible with cell growth or survival, said allele specific 
inhibitor inhibits only one allelic form of said gene present in said patient, and said 
only one allelic form of said gene is present in cancer cells in said patient. 

56. The method of claim 55, further comprising the steps of: 

(a) determining whether non-cancerous cells of said patient are 
heterozygous for a particular gene essential for cell growth or viability; or 

(b) determining whether cancerous cells of said patient have only one 
allele of said particular gene; or 

(c) both (a) and (b). 

57. A method for treating a patient suffering from a cancer, wherein said patient 
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is heterozygous for a gene vital for cell growth or viability, comprising the step of: 
administering a therapeutic amount of an allele specific inhibitor active on at 
least one but less than all allelic forms of said gene present in a population, 

wherein said gene encodes a product required to maintain organic compounds 
5 at levels compatible with cell growth or survival, said allele specific inhibitor inhibits 

only one allelic form of said gene present in said patient, and said only one allelic 
form of said gene is present in cancer cells in said patient. 

58. The method of claim 57, further comprising the steps of: 

(a) determining whether non-cancerous cells of said patient are 
heterozygous for a particular gene essential for cell growth or viability; or 

(b) determining whether cancerous cells of said patient have only one 
allele of said particular gene; or 

(c) both (a) and (b). 

59. A method for treating a patient suffering from a cancer, wherein said patient 
is heterozygous for a gene vital for cell growth or viability, comprising the step of: 

administering a therapeutic amount of an allele specific inhibitor active on at 
least one but less than all allelic forms of said gene present in a population, 

wherein said gene encodes a product required to maintain cellular proteins 
at levels compatible with cell growth or survival, said allele specific inhibitor inhibits 
only one allelic form of said gene present in said patient, and said only one allelic 
form of said gene is present in cancer cells in said patient. 

25 60. The method of claim 59, further comprising the steps of: 

(a) determining whether non-cancerous cells of said patient are 
heterozygous for a particular gene essential for cell growth or viability; or 

(b) determining whether cancerous cells of said patient have only one 
allele of said particular gene; or 



10 



15 



20 



WO 98/41648 



PCT/US98/05419 



355 232/116 

(c) both (a) and (b). 

61. A method for treating a patient suffering from a cancer, wherein said patient 
is heterozygous for a gene vital for cell growth or viability, comprising the step of: 

administering a therapeutic amount of an allele specific inhibitor active on at 
least one but less than all allelic forms of said gene present in a population, 

wherein said gene encodes a product required to maintain cellular nucleotides 
at levels compatible with cell growth or survival, said allele specific inhibitor inhibits 
only one allelic form of said gene present in said patient, and said only one allelic 
form of said gene is present in cancer cells in said patient. 

62. The method of claim 61, further comprising the steps of: 

(a) determining whether non-cancerous cells of said patient are 
heterozygous for a particular gene essential for cell growth or viability; or 

(b) determining whether cancerous cells of said patient have only one 
allele of said particular gene; or 

(c) both (a) and (b). 

63. A method for treating a patient suffering from a cancer, wherein said patient 
is heterozygous for a gene vital for cell growth or viability, comprising the step of: 

administering a therapeutic amount of an allele specific inhibitor active on at 
least one but less than all allelic forms of said gene present in a population, 

wherein said gene encodes a product required to maintain the integrity and 
function of cellular and subcellular structures, said allele specific inhibitor inhibits 
only one allelic form of said gene present in said patient, and said only one allelic 
form of said gene is present in cancer cells in said patient. 

64. The method of claim 63, further comprising the steps of: 

(a) determining whether non-cancerous cells of said patient are 
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heterozygous for a particular gene essential for cell growth or viability; or 

(b) determining whether cancerous cells of said patient have only one 
allele of said particular gene; or 

(c) both (a) and (b). 

5 

65. A method for treating a patient suffering from a cancer, wherein said patient 
is heterozygous for a gene vital for cell growth or viability, comprising the step of: 

administering a therapeutic amount of an allele specific inhibitor active on at 
least one but less than all allelic forms of said gene present in a population, 
10 wherein said gene is located on a high frequency LOH chromosomal arm 

region, said allele specific inhibitor inhibits only one allelic form of said gene 
present in said patient, and said only one allelic form of said gene is present in 
cancer cells in said patient. 

66. The method of claim 65, further comprising the steps of: 

(a) determining whether non-cancerous cells of said patient are 
heterozygous for a particular gene essential for cell growth or viability; or 

(b) determining whether cancerous cells of said patient have only one 
allele of said particular gene; or 

(c) both (a) and (b). 

67. A method for treating a patient suffering from a cancer, wherein said patient 
is heterozygous for a gene vital for cell growth or viability, comprising the step of: 

administering a therapeutic amount of an allele specific inhibitor active on at 
25 least one but less than all allelic forms of said gene present in a population, 

wherein said gene has at least two sequence variances which occur at 
frequences such that at least 10% of a population is heterozygous for said gene, said 
allele specific inhibitor inhibits only one allelic form of said gene present in said 
patient, and said only one allelic form of said gene is present in cancer cells in said 
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patient. 

68. The method of claim 67, further comprising the steps of: 

(a) determining whether non-cancerous cells of said patient are 
heterozygous for a particular gene essential for cell growth or viability; or 

(b) determining whether cancerous cells of said patient have only one 
allele of said particular gene; or 

(c) both (a) and (b). 

69. A method of inhibiting growth of a cell comprising the step of: 
administering at least one inhibitor active on an allele of a gene vital for cell 

viability or growth, 

wherein said gene encodes a product required for cell proliferation, and 
wherein said inhibitor is less active on at least one other allele of said gene. 

70. A method of inhibiting growth of a cell comprising the step of: 
administering at least one inhibitor active on an allele of a gene vital for cell 

viability or growth, 

wherein said gene encodes a product required to maintain inorganic ions and 
vitamins at levels compatible with cell growth or survival, and wherein said inhibitor 
is less active on at least one other allele of said gene. 

71. A method of inhibiting growth of a cell comprising the step of: 
administering at least one inhibitor active on an allele of a gene vital for cell 

viability or growth, 

wherein said gene encodes a product required to maintain organic compounds 
at levels compatible with cell growth or survival, and wherein said inhibitor is less 
active on at least one other allele of said gene. 
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72. A method of inhibiting growth of a cell comprising the step of: 
administering at least one inhibitor active on an allele of a gene vital for cell 

viability or growth, 

wherein said gene encodes a product required to maintain cellular proteins 
5 at levels compatible with cell growth or survival, and wherein said inhibitor is less 

active on at least one other allele of said gene. 

73. A method of inhibiting growth of a cell comprising the step of: 
administering at least one inhibitor active on an allele of a gene vital for cell 

10 viability or growth, 

wherein said gene encodes a product required to maintain cellular nucleotides 
at levels compatible with cell growth or survival, and wherein said inhibitor is less 
active on at least one other allele of said gene. 

74. A method of inhibiting growth of a cell comprising the step of: 
administering at least one inhibitor active on an allele of a gene vital for cell 

viability or growth, 

wherein said gene encodes a product required to maintain the integrity and 
function of cellular and subcellular structures, and wherein said inhibitor is less 
active on at least one other allele of said gene. 

75. A method of inhibiting growth of a cell comprising the step of: 
administering at least one inhibitor active on an allele of a gene vital for cell 

viability or growth, 

wherein said gene is located on a high frequency LOH chromosomal arm 
region, and wherein said inhibitor is less active on at least one other allele of said 
gene. 

76. A method of inhibiting growth of a cell comprising the step of: 
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administering at least one inhibitor active on an allele of a gene vital for cell 
viability or growth, 

wherein said gene has at least two sequence variances which occur at 
frequences such that at least 10% of a population is heterozygous for said gene, and 
5 wherein said inhibitor is less active on at least one other allele of said gene. 

77. A method of identifying a potential patient for treatment with an inhibitor 
active on at least one but less than all alleles of a gene vital for cell growth or 
viability, wherein said patient is suffering from a cancer, said method comprising the 
10 step of: 

identifying a patient heterozygous for a said gene encoding a product required 
for cell proliferation, 

wherein if said patient is heterozygous for said gene, then said patient is a 
potential patient for said treatment. 



15 



20 



78. The method of claim 77, further comprising the step of determining whether 
cancer cells in said patient contain only a single allele of said gene, 

wherein if said cancer cells contain only a single allele of said gene, then said 
patient is a potential patient for said treatment. 



79. A method of identifying a potential patient for treatment with an inhibitor 
active on at least one but less than all alleles of a gene vital for cell growth or 
viability, wherein said patient is suffering from a cancer, said method comprising the 
step of: 

25 determining whether cancer cells in said patient have undergone LOH of a 

said gene encoding a product required for cell proliferation, 

wherein if said cells have undergone LOH of said gene, then said patient is 
a potential patient for said treatment. 
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80. A method of identifying a potential patient for treatment with an inhibitor 
active on at least one but less than all alleles of a gene vital for cell growth or 
viability, wherein said patient is suffering from a cancer, said method comprising the 
step of: 

5 identifying a patient heterozygous for a said gene encoding a product required 

to maintain inorganic ions and vitamins at levels compatible with cell growth or 
survival, 

wherein if said patient is heterozygous for said gene, then said patient is a 
potential patient for said treatment. 

10 

81 . The method of claim 80, further comprising the step of determining whether 
cancer cells in said patient contain only a single allele of said gene, 

wherein if said cancer cells contain only a single allele of said gene, then said 
patient is a potential patient for said treatment. 

15 

82. A method of identifying a potential patient for treatment with an inhibitor 
active on at least one but less than all alleles of a gene vital for cell growth or 
viability, wherein said patient is suffering from a cancer, said method comprising the 
step of: 

20 determining whether cancer cells in said patient have undergone LOH of a 

said gene encoding a product required to maintain inorganic ions and vitamins at 
levels compatible with cell growth or survival, 

wherein if said cells have undergone LOH of said gene, then said patient is 
a potential patient for said treatment. 

25 

83. A method of identifying a potential patient for treatment with an inhibitor 
active on at least one but less than all alleles of a gene vital for cell growth or 
viability, wherein said patient is suffering from a cancer, said method comprising the 
steps of: 
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identifying a patient heterozygous for a said gene encoding a product required 
to maintain organic compounds at levels compatible with cell growth or survival; 

wherein if said patient is heterozygous for said gene, then said patient is a 
potential patient for said treatment. 

5 

84. The method of claim 83, further comprising the step of determining whether 
cancer cells in said patient contain only a single allele of said gene, 

wherein if said cancer cells contain only a single allele of said gene, then said 
patient is a potential patient for said treatment. 

10 

85. A method of identifying a potential patient for treatment with an inhibitor 
active on at least one but less than all alleles of a gene vital for cell growth or 
viability, wherein said patient is suffering from a cancer, said method comprising the 
step of: 

15 determining whether cancer cells in said patient have undergone LOH of a 

said gene encoding a product required to maintain organic compounds at levels 
compatible with cell growth or survival, 

wherein if said cells have undergone LOH of said gene, then said patient is 
a potential patient for said treatment. 

20 

86. A method of identifying a potential patient for treatment with an inhibitor 
active on at least one but less than all alleles of a gene vital for cell growth or 
viability, wherein said patient is suffering from a cancer, said method comprising the 
steps of: 

25 identifying a patient heterozygous for a said gene encoding a product required 

to maintain cellular proteins at levels compatible with cell growth or survival ; 

wherein if said patient is heterozygous for said gene, then said patient is a 
potential patient for said treatment. 
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87. The method of claim 86, further comprising the step of determining whether 
cancer cells in said patient contain only a single allele of said gene, 

wherein if said cancer cells contain only a single allele of said gene, then said 
patient is a potential patient for said treatment. 

88. A method of identifying a potential patient for treatment with an inhibitor 
active on at least one but less than all alleles of a gene vital for cell growth or 
viability, wherein said patient is suffering from a cancer, said method comprising the 
step of: 

determining whether cancer cells in said patient have undergone LOH of a 
said gene encoding a product required to maintain cellular proteins at levels 
compatible with cell growth or survival , 

wherein if said cells have undergone LOH of said gene, then said patient is 
a potential patient for said treatment. 

89. A method of identifying a potential patient for treatment with an inhibitor 
active on at least one but less than all alleles of a gene vital for cell growth or 
viability, wherein said patient is suffering from a cancer, said method comprising the 
steps of: 

identifying a patient heterozygous for a said gene encoding a product required 
to maintain cellular nucleotides at levels compatible with cell growth or survival ; 

wherein if said patient is heterozygous for said gene, then said patient is a 
potential patient for said treatment. 

90. The method of claim 89, further comprising the step of determining whether 
cancer cells in said patient contain only a single allele of said gene, 

wherein if said cancer cells contain only a single allele of said gene, then said 
patient is a potential patient for said treatment. 
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91. A method of identifying a potential patient for treatment with an inhibitor 
active on at least one but less than all alleles of a gene vital for cell growth or 
viability, wherein said patient is suffering from a cancer, said method comprising the 
step of: 

determining whether cancer cells in said patient have undergone LOH of a 
said gene encoding a product required to maintain cellular nucleotides at levels 
compatible with cell growth or survival, 

wherein if said cells have undergone LOH of said gene, then said patient is 
a potential patient for said treatment. 



92. A method of identifying a potential patient for treatment with an inhibitor 
active on at least one but less than all alleles of a gene vital for cell growth or 
viability, wherein said patient is suffering from a cancer, said method comprising the 
steps of: 

15 identifying a patient hetero2ygous for a said gene encoding a product required 

to maintain the integrity and function of cellular and subcellular structures ; 

wherein if said patient is heterozygous for said gene, then said patient is a 
potential patient for said treatment. 

20 93 . The method of claim 91 , further comprising the step of determining whether 

cancer cells in said patient contain only a single allele of said gene, 

wherein if said cancer cells contain only a single allele of said gene, then said 
patient is a potential patient for said treatment. 

25 94 • A method of identifying a potential patient for treatment with an inhibitor 

active on at least one but less than all alleles of a gene vital for cell growth or 
viability, wherein said patient is suffering from a cancer, said method comprising the 
step of: 

determining whether cancer cells in said patient have undergone LOH of a 
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said gene encoding a product required to maintain the integrity and function of 
cellular and subcellular structures, 

wherein if said cells have undergone LOH of said gene, then said patient is 
a potential patient for said treatment. 

95. A method of identifying a potential patient for treatment with an inhibitor 
active on at least one but less than all alleles of a gene vital for ceil growth or 
viability, wherein said patient is suffering from a cancer, said method comprising the 
steps of: 

identifying a patient heterozygous for a said gene located on a high frequency 
LOH chromosomal arm region ; 

wherein if said patient is heterozygous for said gene, then said patient is a 
potential patient for said treatment. 

96. The method of claim 95, further comprising the step of determining whether 
cancer cells in said patient contain only a single allele of said gene, 

wherein if said cancer cells contain only a single allele of said gene, then said 
patient is a potential patient for said treatment. 

97. A method of identifying a potential patient for treatment with an inhibitor 
active on at least one but less than all alleles of a gene vital for cell growth or 
viability, wherein said patient is suffering from a cancer, said method comprising the 
step of: 

determining whether cancer cells in said patient have undergone LOH of a 
said gene located on a high frequency LOH chromosomal arm region, 

wherein if said cells have undergone LOH of said gene, then said patient is 
a potential patient for said treatment. 

98. A method of identifying a potential patient for treatment with an inhibitor 
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active on at least one but less than all alleles of a gene vital for cell growth or 
viability, wherein said patient is suffering from a cancer, said method comprising the 
steps of: 

identifying a patient heterozygous for a said gene which has at least two 
sequence variances which occur at frequences such that at least 10% of a population 
is heterozygous for said gene; 

wherein if said patient is heterozygous for said gene, then said patient is a 
potential patient for said treatment. 

99. The method of claim 98, further comprising the step of determining whether 
cancer cells in said patient contain only a single allele of said gene, 

wherein if said cancer cells contain only a single allele of said gene, then said 
patient is a potential patient for said treatment. 

100. A method of identifying a potential patient for treatment with an inhibitor 
active on at least one but less than all alleles of a gene vital for cell growth or 
viability, wherein said patient is suffering from a cancer, said method comprising the 
step of: 

determining whether cancer cells in said patient have undergone LOH of a 
said gene which has at least two sequence variances which occur at frequences such 
that at least 10% of a population is heterozygous for said gene, 

wherein if said cells have undergone LOH of said gene, then said patient is 
a potential patient for said treatment. 

101. A nucleic acid probe at least 12 nucleotides in length which is perfectly 
complementary to a portion of a first allelic form of a gene vital for cell growth or 
viability, 

wherein said gene encodes a product required for cell proliferation, 
wherein said portion comprises a sequence variance site, and wherein said probe 
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hybridizes under stringent hybridization conditions to said portion and not to a 
corresponding portion of a second allelic form having at least one different 
nucleotide at said sequence variance site. 

102. A nucleic acid probe at least 12 nucleotides in length which is perfectly 
complementary to a portion of a first allelic form of a gene vital for cell growth or 
viability, 

wherein said gene encodes a product required to maintain inorganic 
ions and vitamins at levels compatible with cell growth or survival, wherein said 
portion comprises a sequence variance site, and wherein said probe hybridizes under 
stringent hybridization conditions to said portion and not to a corresponding portion 
of a second allelic form having at least one different nucleotide at said sequence 
variance site. 

103. A nucleic acid probe at least 12 nucleotides in length which is perfectly 
complementary to a portion of a first allelic form of a gene vital for cell growth or 
viability, 

wherein said gene encodes a product required to maintain organic 
compounds at levels compatible with cell growth or survival, wherein said portion 
comprises a sequence variance site, and wherein said probe hybridizes under 
stringent hybridization conditions to said portion and not to a corresponding portion 
of a second allelic form having at least one different nucleotide at said sequence 
variance site. 

104. A nucleic acid probe at least 12 nucleotides in length which is perfectly 
complementary to a portion of a first allelic form of a gene vital for cell growth or 
viability, 

wherein said gene encodes a product required to maintain cellular 



WO 98/41648 



PCT/US98/05419 



367 232/116 

proteins at levels compatible with cell growth or survival, wherein said portion 
comprises a sequence variance site, and wherein said probe hybridizes under 
stringent hybridization conditions to said portion and not to a corresponding portion 
of a second allelic form having at least one different nucleotide at said sequence 
5 variance site. 



105. A nucleic acid probe at least 12 nucleotides in length which is perfectly 
complementary to a portion of a first allelic form of a gene vital for cell growth or 

10 viability, 

wherein said gene encodes a product required to maintain cellular 
nucleotides at levels compatible with cell growth or survival, wherein said portion 
comprises a sequence variance site, and wherein said probe hybridizes under 
stringent hybridization conditions to said portion and not to a corresponding portion 
15 of a second allelic form having at least one different nucleotide at said sequence 

variance site. 

106. A nucleic acid probe at least 12 nucleotides in length which is perfectly 
complementary to a portion of a first allelic form of a gene vital for cell growth or 

20 viability, 

wherein said gene encodes a product required to maintain the integrity 
and function of cellular and subcellular structures, wherein said portion comprises 
a sequence variance site, and wherein said probe hybridizes under stringent 
hybridization conditions to said portion and not to a corresponding portion of a 
25 second allelic form having at least one different nucleotide at said sequence variance 

site. 



107. A nucleic acid probe at least 12 nucleotides in length which is perfectly 
complementary to a portion of a first allelic form of a gene vital for cell growth or 
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viability, 

wherein said gene is located on a high frequency LOH chromosomal 
arm region, wherein said portion comprises a sequence variance site, and wherein 
said probe hybridizes under stringent hybridization conditions to said portion and not 
5 to a corresponding portion of a second allelic form having at least one different 

nucleotide at said sequence variance site. 

108. A nucleic acid probe at least 12 nucleotides in length which is perfectly 
complementary to a portion of a first allelic form of a gene vital for cell growth or 
viability, 

wherein said gene has at least two sequence variances which occur at 
frequences such that at least 10% of a population is heterozygous for said gene, 
wherein said portion comprises a sequence variance site, and wherein said probe 
hybridizes under stringent hybridization conditions to said portion and not to a 
corresponding portion of a second allelic form having at least one different 
nucleotide at said sequence variance site. 

109. The method , inhibitor, pharmaceutical composition, or nucleic acid probe of 
any of claims 1, 13, 21, 29, 37, 53, 69, 77, and 101, wherein said gene is selected 

20 from the group consisting of 14-3-3 Protein TAU, CCNA(G2/Mitotic-Specific Cyclin 

A ), CCNBl(G2/Mitotic-Specific Cyclin Bl), CCND1(G1/S-Specific Cyclin Dl), 
CCND2(G1/S-Specific Cyclin D2), CCND3(G1/S-Specific Cyclin D3), Ceil division 
control protein 16, Cell division cycle 2, Gl to S and G2 to M, Cell division cycle 25 A, 
Cell division cycle 25B, Ceil division cycle 25C, Cell division cycle 27, Cell division- 

25 associated protein BIMB, Cyclin Al(G2/Mitotic-Specific Cyclin Al), Cyclin C 

(Gl/S-Specific Cyclin C), Cyclin Gl(G2/Mitotic-Specific Cyclin G), Cyclin G2 
(G2/Mitotic-Specific Cyclin G), Cyclin H, Cyclin H Assembly, GSPT1(G1 to S 
phase transition 1), Mitotic MAD2 Protein, MRNP7, RANBP1(RAN binding protein 
1), WEE1, Cell Division Protein Kinase 4, CDC28 protein kinase 1, CDC28 protein 
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kinase 2, M-Phase inducer phosphatase 2, M-phase phosphoprotein, mpp6, 
PPPlca(Protein phosphatase 1, catalytic subunit, alpha isoform), STM7-LSB, 
CENP-F kinetochore protein, Centromere autoantigen C, Centromere protein B 
(80kD), Centromere protein E (312kD), CHC1 (Chromosome condensation 1), 
Chromatin assembly factor-I pi 50 subunit, Chromatin assembly factor-I p60 subunit, 
Chromosome segregation gene homolog CAS, HMG1 (High-mobility group 
(nonhistone chromosomal) protein 1), Minichromosome Maintenance (MCM7), 
Mitotic centromere-associated kinesin, RMSA1 (Regulator of mitotic spindle 
assembly 1), and SUPT5h(Chromatin structural protein homolog (SUPT5H)). 

110. The method , inhibitor, pharmaceutical composition, or nucleic acid probe of 
any of claims 2, 14, 22, 30, 39, 55, 70, 80, and 102, wherein said gene is selected 
from the group consisting of PMCA1 (Calcium Pump), PMCA2 (Calcium Pump), 
PMCA3 (Calcium Pump), PMCA4 (Calcium Pump), ATP2bl (Calcium-Transporting 
ATPase Plasma Membrane), ATP2b2 (Calcium-Transporting ATPase Plasma 
Membrane), ATP2b4 (Calcium-Transporting ATPase Plasma Membrane), ATP5b 
(ATP Synthase Beta Chain, Mitochondrial Precursor), Chloride Conductance 
Regulatory Protein ICLN, H-Erg (Potassium Channel Protein EAG), Nuclear 
Chloride Ion Channel Protein (NCC27), SCNlb(Sodium Channel, Voltage-Gated, 
Type I, Beta Polypeptide), Two P-Domain K-h Channel TWK-1, VDAC2 (Voltage- 
Dependent Anion-Selective Channel Protein 2), ATPlbl (Sodium/Potassium- 
Transporting ATPase Beta-1 Chain), ATPlb2 (Sodium/Potassium-Transporting 
ATPase Beta-2 Chain), ATPase, Ca+ + transporting, plasma membrane 4, ATPase, 
Ca+ + transporting, plasma membrane 2, ATPase, Na+/K+ transporting, alpha 1 
polypeptide, ATPase, Na+/K+ transporting, alpha 3 polypeptide, ATPase, 
Na+/K+ transporting, beta 1 polypeptide, ATPase, Na+/K+ transporting, beta 2 
polypeptide, Na+,K+ ATPase, 1 Subunit, Na + ,K+ ATPase, 2 alpha, Na+,K + 
ATPase, 3 beta, SLC9al(Solute carrier family 9 (sodium/hydrogen exchanger)), 
Solute carrier family 4, anion exchanger, member 1, Solute carrier family 4, anion 
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exchanger, member 2, Solute carrier family 9 (sodium/hydrogen exchanger), Passive 
transporters, MaxiK Potassium Channel Beta Subunit, Chloride Channel 2, Chloride 
Channel Protein (CLCN7), TRPC1 (Transient Receptor Potential Channel 1), 
Potassium Channel Kv2.1, ATP5d(ATP synthase, H + transporting, mitochondrial 
Fl complex, delta subunit), ATP5fl(ATP synthase, H+ transporting, mitochondrial 
F0 complex, subunit b), ATP5o(ATP synthase, H-f transporting, mitochondrial Fl 
complex, O subunit), ETFa(Electron-transfer-flavoprotein, alpha polypeptide 
(glutaric aciduria II)), ETFb(Electron-transfer-flavoprotein, beta polypeptide), Nadh- 
ubiquinone oxidoreductase 13 kd-B subunit, Nadh-ubiquinone oxidoreductase 39 kD 
subunit precursor, NADH-Ubiquinone oxidoreductase 75 kD subunit precursor, 
NADH-Ubiquinone oxidoreductase MFWE subunit, NDUFV2(NADH 
dehydrogenase (ubiquinone) flavoprotein 2 (24kD)), Ubiquinol-cytochrome c 
reductase complex 11 kD, ATP Synthase Alpha Chain, NADH dehydrogenase- 
ubiquinone Fe-S protein 8, 23 kDa subunit, Ascorbic Acid (transporter), Folate 
Binding Protein, Folate receptor 1 (adult), Nicotinamide (transporter), Pantothenic 
Acid transporter, Riboflavin (transporter), SCL19A1 (Solute Carrier Family 19, 
Memberl), Solute carrier family 19 (folate transporter), member 1, Thiamine, B6, 
B12 (transporter), ATP7b (Copper-Transporting ATPase 2), Ceruloplasmin 
(ferroxidase), Ceruloplasmin receptor (Copper Transporter), Copper Transport 
Protein HAH1, Molybdenum, Selenium, Tranferrin Receptor (Iron Transporter), 
Zinc Transporter, and mitochondrial import receptor subunit TOM20. 

111. The method , inhibitor, pharmaceutical composition, or nucleic acid probe of 
3, 25, 23, 31, 41, 57, 71, 83, and 103, wherein said gene is selected from the group 
consisting of GLUT1, GLUT2, GLUT3, GLUT4, GLUT5, GLUT6, Solute carrier 
family 5 (sodium/glucose cotransporter), Solute carrier family 2 (facilitated glucose 
transporter), member 2, Solute carrier family 2 (facilitated glucose transporter) 
member 5, Solute carrier family 3 member 1, System b,(Na+ independent), System 
y,(Na+ independent), ATRCl(Catioinc), LEUT(Leucine Transporter), 
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SLClAl(Solute Carrier Family 1, Member 1), Solute carrier family 16 
(monocarboxylic acid transporters), AC01(Aconitase 1), AC02(Aconitase 2, 
mitochondrial), Acyl-Coenzyme A dehydrogenase, C-2 to C-3 short chain, Acyl- 
Coenzyme A dehydrogenase, C-4 to C-12 straight chain, Acyl-Coenzyme A 
dehydrogenase, long chain, Acyl-Coenzyme A dehydrogenase, very long chain, 
aKGD (alpha ketoglutaratedehydrogenase), ALD-a (Aldolase), ALD-b (Aldolase), 
ALD-c (Aldolase), CS (Citrate Synthetase), Dihydrolipoamide S-succinyltransferase, 
DLAT(Dihydrolipoamide S-acetyltransferase (E2 component of pyruvate 
dehydrogenase complex)), DLD(Dihydrolipoamide dehydrogenase (E3 component 
of pyruvate dehydrogenase complex, 2-oxo-glutarate complex, branched chain keto 
acid dehydrogenase complex)), Elk (Oxoglutarate dehydrogenase), E2k 
(Dihydrolipoamide S-succinyltransferase), E3 (Dihydrolipoyl Dehydrogenase), 
EN01(Enolase 1, alpha), EN02(Enolase 2), EN03(Enolase 3), Enolase 2, (gamma, 
neuronal), Enolase 3, (beta, muscle), FH(Fumarate hydratase), G3PDH 
(Glyceraldehyde-3-Phosphate Dehydrogenase), G6PD (Glucose-6-Phosphate 
Dehydrogenase), Glucose-6-phosphate dehydrogenase, HK1 (Hexokinase 1), HK2 
(Hexokinase 2), HK3 (Hexokinase 3), IDHl(Isocitrate dehydrogenase 1 (NADP+), 
soluble) , IDH2(Isocitrate dehydrogenase 2 (NADP + ) , mitochondrial) , 
MDHl(Malate dehydrogenase 1, NAD (soluble)), MDH2(Malate dehydrogenase 1, 
NAD (mitochondrial)), NAD(H)-specific isocitrate dehydrogenase alpha subunit, 
Oxoglutarate dehydrogenase (Hpoamide), PDHB (Pyruvate Dehydrogenase), 
PDHB(Pyruvate dehydrogenase (Hpoamide) beta), PDK4 (Pyruvate dehydrogenase 
kinase, isoenzyme 4), PFKL(Phosphofructokinase), PGI (Phosphoglucoisomerase), 
PGKa (Phosphoglyceromutase), PGKb (Phosphoglyceromutase), PGM1 
(Phosphoglyceromutase), PGM2 (Phosphoglyceromutase), PGM3 
(Phosphoglyceromutase), PGM4 (Phosphoglyceromutase), Phosphofructokinase, 
muscle, Phosphoglucomutase 1, Phosphoglycerate kinase 1, PK1 (Pyruvate Kinase), 
PK2 (Pyruvate Kinase), PK3 (Pyruvate Kinase), Pyruvate dehydrogenase kinase 
isoenzyme 2 (PDK2), Pyruvate kinase, liver, Pyruvate kinase, muscle, 
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SDH 1 (Succinate dehydrogenase, iron sulphur (Ip) subunit), SDH2(Succinate 
dehydrogenase 2, flavoprotein (Fp) subunit), TKT(Transketolase (Wernicke- 
Korsakoff syndrome)), TPI (Trisephosphate Isomerase), Asparagine Synthetase, 
Aminoacylase-1, Aminoacylase-2, ACAC (Acetyl CoA Carboxylase Beta), ACAC 
(Acetyl CoA Carboxylase), ACADSB(Acyl-coA dehydrogenase), Mevalonate kinase, 
Phosphomevalonate kinase, Aspartoacylase, Ornithine decarboxylase 1, Short-acyl- 
CoA dehydrogenase, Medium acyl-CoA dehydrogenase, Long acyl-CoA 
dehydrogenase, Isovalveryl CoA dehydrogenase, 2-methyl branched chain, 
Adenosine Deaminase, Purine-nucleoside phosphorylase, Guanine Deaminase, 
Xanthine Oxidase, ITM1 (Integral Transmembrane Protein), GFPT (Glutamine- 
Fructose-6-Phosphate Transaminase), Heparan, Polypeptide N-Acetyltransferase, 
AC AA( Acetyl-Coenzyme A acyltransferase), Lysophosphatidic acid acyltransferase- 
alpha, Lysophosphatidic acid acyltransferase-beta, FNTa (Farnesyltransferase Alpha 
Subunit), FNTb (Farnesyltransferase Beta Subunit), NMT1 (N-myristoyltransferase), 
Calcineurin A, Calcineurin B, Calreticulin Precursor, Phosphatase 2b, 
PPP3ca(Protein phosphatase 3 , catalytic subunit), SNK Interacting 2-28(Calcineurin 
B Subunit), Protein Kinase C, PRKCA(Protein kinase C, alpha), PRKCB1 (Protein 
kinase C, beta 1), PRKCD(Protein kinase C, delta), PRKCM(Protein kinase C, mu), 
PRKCQ(Protein kinase C-theta), PRKCSH(Protein kinase C substrate 80K-H), 
Geranylgeranyl, Geranylgeranyltransferase (Type I Beta), GGTB 
(Geranylgeranyltransferase), Geranylgeranyltransferase (Type II Beta-Subunit), Gdp 
Dissociation Inhibitors, GDI Alpha (RAB GDP Dissociation Inhibitor Alpha), and 
Rab Gdp (RAB GDP Dissociation Inhibitor Alpha). 

112. The method, inhibitor, pharmaceutical composition, or nucleic acid probe of 
any of claims 4, 16, 24, 32, 43, 59, 72, 86, and 104, wherein said gene is selected 
from the group consisting of GOT(Glutamic-oxaloacetic transaminase 2), 
GOTl(Glutamic-oxaloacetic transaminase 1), PYCS(Pyrroline-5-carboxylate 
synthetase), Tyrosine aminotransferase, AARS, CARS, DARS, EPRS, FARS, 
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GARS, HARS, IARS, KARS, LARS, MARS, NARS, QARS , RARS, SARS, 
TARS, VARS, WRS, YARS, Ribosomal Protein Lll, Ribosomal Protein L12, 
Ribosomal Protein L17, Ribosomal Protein L18, Ribosomal Protein LI 8a, 
Ribosomal Protein L19, Ribosomal Protein L21, Ribosomal Protein L22, Ribosomal 

5 Protein L23, Ribosomal Protein L23a, Ribosomal Protein L25, Ribosomal Protein 

L26, Ribosomal Protein L27, Ribosomal Protein L27a, Ribosomal Protein L28, 
Ribosomal Protein L29, Ribosomal Protein L30, Ribosomal Protein L31, Ribosomal 
Protein L32, Ribosomal Protein L35, Ribosomal Protein L35a, Ribosomal Protein 
L36a, Ribosomal Protein L39, Ribosomal Protein L4, Ribosomal Protein L41, 

0 Ribosomal Protein L44, Ribosomal Protein L6, Ribosomal Protein L7, Ribosomal 

Protein L7a, Ribosomal Protein L8, Ribosomal Protein L9, Ribosomal Protein PI, 
Ribosomal Protein S10, Ribosomal Protein Sll, Ribosomal Protein S13, Ribosomal 
Protein S14, Ribosomal Protein S15, Ribosomal Protein S15A, Ribosomal Protein 
S16, Ribosomal Protein S17, Ribosomal Protein S17A, Ribosomal Protein S17B, 

5 Ribosomal Protein S18, Ribosomal Protein S20, Ribosomal Protein S20A, 

Ribosomal Protein S20B, Ribosomal Protein S21 , Ribosomal Protein S23, Ribosomal 
Protein S25, Ribosomal Protein S26, Ribosomal Protein S28, Ribosomal Protein 
S29, Ribosomal Protein S3, Ribosomal Protein S3 A, Ribosomal Protein S4, 
Ribosomal Protein S4X, Ribosomal Protein S4Y, Ribosomal Protein S5, Ribosomal 

0 Protein S6, Ribosomal Protein S7, Ribosomal Protein S8, Ribosomal Protein S9, 

Initiation of polypeptide polymerization, eIF-2 (Eukaryotic initiation factor), eIF-2- 
associated p67(Eukaryotic initiation factor), eIF-2A(Eukaryotic initiation factor), 
eIF-2Alpha(Eukaryotic initiation factor), eIF-2B(Eukaryotic initiation factor), elF- 
2B-Gamma(Eukaryotic initiation factor), eIF-2Beta(Eukaryotic initiation factor), elF- 

5 3 pllO(Eukaryotic initiation factor), eIF-3 p36(Eukaryotic initiation factor), elF- 

4A(Eukaryotic initiation factor), eIF-4C(Eukaryotic initiation factor), elF- 
4E(Eukaryotic initiation factor), eIF-4Gamma(Eukaryotic initiation factor), elF- 
5 (Eukaryotic initiation factor), eIF-5A, Eukaryotic peptide chain release factor 
subunit 1, P97(Eukaryotic initiation factor), eEFlA2(Eukaryotic elongation factor), 
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eEFlD(Eukaryotic elongation factor), eEF2(Eukaryotic elongation factor), eIF4A2 
(Eukaryotic initiation factor), KIAA0031 (Elongation factor 2), KIAA0219(Putative 
translationa! activator C18G6.05C), Factor 1-Alpha 2(Eukaryotic translation 
elongation factor 1 alpha 2), Cis-Trans Isoraerase, DNAj Protein Homolog 1, DNAj 
Protein Homolog 2, DNAJ Protein homolog HSJ1, T-Complex, 
Aspartylglucosaminidase, T-Complex 1, Alpha, T-Complex 1, Epsilon, T-Complex 
1, Gamma, T-Complex 1, Theta, T-Complex 1, Zeta, 26S Protease regulatory 
subunit 4, Alpha-2-Macroglobulin, Calpain 1, Large, CLPP(ATP-Dependent CLP 
protease proteolytic subunit), KIAA0123 (Mitochondrial processing peptidase alpha 
subunit), MMP7, Proteasome Beta 6, Proteasome Beta 7, Proteasome C13, 
Proteasome C2, Proteasome C7-1, Proteasome inhibitor hPI31 subunit, Proteasome 
PI 12, Proteasome P27, Proteasome P55, Enzyme E2-17 Kd(Cyclin-selective 
ubiquitin carrier protein), ISOT-3(Ubiquitin carboxyl-terminal hydrolase T ), ORF 
(Ubiquitin carboxyl-terminal hydrolase 14), PGP(Ubiquitin carboxyl-terminal 
hydrolase isozyme LI), UBA52(Ubiquitin A-52 residue ribosomal protein fusion 
product 1), Ubiquitin carboxyl-terminal hydrolase 3, Ubiquitin carboxyl-terminal 
hydrolase isozyme L3, Ubiquitin carboxyl-terminal hydrolase T, Ubiquitin carrier 
protein (E2-EPF), Ubiquitin fusion-degradation protein (UFD1L), Ubiquitin 
Hydrolase, Ubiquitin-conjugating enzyme E2I, SEC23(Protein transport protein 
SEC23), SEC23A(Protein transport protein SEC23), SEC7(Protein transport protein 
SEC7), SEC61 (Beta Subunit), and LDLR (LDL receptor). 

113. The method, inhibitor, pharmaceutical composition, or nucleic acid probe of 
any of claims 5, 17, 25, 33, 45, 73, 89, and 105, wherein said gene is selected from 
the group consisting of Adenylate Kinase-2, Adenylosuccinate synthetase, 
Adenylosuccinate Lyase, DPRT (ADP-Ribosyltransferase), ADSL (Adenylosuccinate 
lyase/ AMP synthetase), ADSS (Adenylosuccinate Synthetase), CAD PROTEIN, 
CTP Synthetase, CTPS(CTP synthetase), Cytidine Triphosphate Synthetase, GARS 
(Phosphoribosylglycinamide synthetase), GART (Phosphoribosylglycinamide 
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formyltransferase), GART(Phosphoribosylglycinamide formyltransferase, 
phosphoribosylglycinamide synthetase, phosphoribosylaminoimidazole synthetase), 
GMP Synthetase, IMP Cyclohydrolase, IMP dehydrogenase, IMPDH1(IMP (inosine 
monophosphate) dehydrogenase 1), IMPDH2(IMP (inosine monophosphate) 
dehydrogenase 2), Phosphoribosyl diphosphotransferase, 
Phosphoribosylaminoimidazolecarboxamide formyltransferase, 
Phosphoribosylformylglycinamide synthetase, Phosphoribosylglycinamide 
carboxylase, Phosphoribosylglycinamide-succinocarboxamide synthetase, PPAT 
(Amidophoribosyltransferase), PPAT(Phosphoribosyl pyrophosphate 
amidotransferase), RibonucleosideKiiphosphate reductase Ml chain, Ribonucleoside- 
diphosphate reductase M2 chain, Thymidine Kinase, Thymidylate Synthase, 
UMK(Uridine kinase), UMPK (Uridine monophosphate kinase), UMPS(Uridine 
monophosphate synthetase (orotate phosphoribosyl transferase and orotidine-5'- 
decarboxylase)), Uridine Phosphorylase, DNA Origin Recognition Complex, ORC1, 
ORC2, ORC3, ORC4, ORC5, ORC6, ORC Regulators, CDC6, CDC7, CDC1, 
DNA Polymerization, DNA Polymerases, Adprt (NAD(+) ADP- 
Ribosyltransferase), DNA Polymerase Alpha-Subunit, DNA Polymerase Delta, 
POLa(DNA Polymerase Alpha/Primase Associated Subunit), POLb(DNA 
Polymerase Beta Subunit), POLdl (Polymerase (DNA directed), Delta 1, Catalytic 
Subunit), POLd2(Polymerase (DNA directed), Delta 2), POLE(Polymerase (DNA 
directed)), POLg (DNA Polymerase Gamma Subunit), Terminal Transferase (DNA 
Nucleotidylexotransferase), Activator 1 36 Kd, CDC46 (DNA Replication Licensing 
Factor), CDC47 (DNA Replication Licensing Factor CDC47), DNA Topoisomerase 
III, DRAP1 (DNA Replication Licensing Factor MCM3), KIAA0030 Gene (Cell 
Division Control Protein 19), KIAA0083 Gene (DNA Replication Helicase DNA2 
), MCM3 (DNA Replication Licensing Factor MCM3), PCNA (Proliferating Cell 
Nuclear Antigen), PRTM1 (DNA Primase 49 kD Subunit ), PRIM2 (DNA Primase), 
PRIM2a (DNA Primase 58 kD Subunit), PRIM2b (DNA Primase), RECa 
(Replication Protein A 14 kD Subunit), RFC1 (Replication Factor C (activator 1) 1), 



WO 98/41648 



PCT/US98/05419 



376 232/116 

RFC2 (Replication Factor C 2), RFC3 (Replication Factor C (activator 1) 3), RFC4 
(Replication Factor C, 37-kD subunit), RFC5 (Replication Factor C), RPA1 
(Replication protein Al (70kD)), RPA2 (Replication protein A2 (32kD)), RPA3 
(Replication protein A3 (14kD)), TOPI (DNA Topoisomerase I), TOP2a 
(Topoisomerase (DNA) II Alpha (170kD)), TOP2b (Topoisomerase (DNA) II Beta 
(180kD)), CHL1(CHL1-Related Helicase), DNA Helicase II, Mi-2(Chromodomain- 
Helicase- DNA-Binding Protein CHD-1 ), RECQL (ATP-Dependent DNA Helicase 
Ql), Smbp2 (DNA-Binding Protein SMUBP-2), H1(0) (Histone H5A), Histone Hid, 
Histone Hlx, Histone H2a.l, Histone H2a.2, Histone H2b.l, Histone H4, SLBP 
(Histone Hairpin-Binding Protein), TATA-binding Complex, Small Nuclear RNA- 
Activating Complex, Polypeptide 1, 43KD (SNAPC1), Small Nuclear RNA- 
Activating Complex, Polypeptide 2, (SNAPC2), Small Nuclear RNA_Activating 
Complex, Polypeptide 3, 50KD (SNAPC3), TAF2D(TBP-associated factor), 
TAFII100(TBP-associated factor), TAFni30(TBP-associated factor), TAFH20(TBP- 
associated factor), TAFII250(TBP-associated factor), TAFII28(TBP-associated 
factor), TAFII30(TBP-associated factor), TAFn32(TBP-associated factor), 
TAFII40(TBP-associated factor), TAFII55(TBP-associated factor), TAFII80(TBP- 
associated factor), TBP(TATA Binding Protein), TMF1 (TATA Element Modulatory 
Factor 1), RPB 7.0, RPB 7.6, RPB 17, RPB 14.4, RNA polymerase I subunit 
hRPA39, 13.6 Kd Polypeptide (DNA-Directed RNA Polymerase II 13.6 kD 
Polypeptide), POLR2C(RNA polymerase n, polypeptide C (33kD)), Polypeptide A 
(220kd), RNA Polymerase II 23k, RNA polymerase II holoenzyme component 
(SRB7), RNA polymerase II subunit (hsRPBlO), RNA polymerase II subunit 
(hsRPB8), RNA polymerase II subunit hsRPB4, RNA polymerase II subunit 
hsRPB7, RNA Polymerase II Subunit(DNA- Directed RNA Polymerases I, II, and 
III 7.3 kD polypeptide), TCEBlL(Transcription elongation factor B (SHI), 
polypeptide l-like), RNA polymerase III subunit (RPC39), RNA polymerase III 
subunit (RPC62), Elongation Factor 1-Beta, Elongation Factor S-II, TCEA (HOkD), 
TCEB1, TCEB (18kD), TCEB1L, TCEB3, TCEC (15kDa), TFHS (Transcription 
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Elongation Factor IIS), E2F1 (E2F Transcription Factor), TFAP2A (Transcription 
Factor A2 Alpha), TFCP2 (Transcription Factor CP2), TFC12 (Transcription Factor 
12), PRKDC (Protein Kinase, DNA activated catalytic subunit), SUPT6H, TFIIA 
gamma subunit, TFIIA delta, TFIIB related factor hBRF (HBRF), TFIIE Alpha 
Subunit, TFIIE Beta Subunit, TFIIF, Beta Subunit, GTF2F1 (TFIIF), GTF2F2 
(TFIIF), General Transcription Factor III A, TFIIH(52 kD subunit of transcription 
factor), TFHH(p89), TFIIH(p80), TFIIH(p62), TFIIH(p44), TFIIH(p34), 
Transcription Factor IIf(General transcription factor IIF, polypeptide 1 (74kD 
subunit))Transcription Factor IIf(General transcription factor IIF, polypeptide 1 
(74kD subunit)), BTF 62 kDSubunit (Basic transcription factor 62 kD subunit), 
CAMP-dependent transcription factor ATF-4, CCAAT box-binding transcription 
factor 1, CRM 1 (Negative regulator CRM1), Cyclic- AMP-dependent transcription 
factor ATF-1, GABPA(GA-binding protein transcription factor, alpha subunit 
(60kD)), ISGF-3 (Signal transducer and activator of transcription 1 -alpha/beta), 
NFIX(Nuclear factor I/X (CCAAT-binding transcription factor)), NFYA(Nuclear 
transcription factor Y, alpha), NTF97(Nuclear factor p97), Nuclear factor I-B2 
(NFIB2), Nuclear factor NF45, Nuclear factor NF90, POU2Fl(POU domain, class 
2, transcription factor 1), Sp2 transcription factor, TCF12(Transcription factor 12 
(HTF4, helix-loop-helix transcription factors 4)), TCF3(Transcription factor 3 (E2A 
immunoglobulin enhancer binding factors E12/E47)), TCF6L1 (Transcription factor 
6-like 1), TF P65 (Transcription factor p65), TFCOUP2(Transcription factor COUP 
2 (a.k.a. ARP1)), Transcription factor IL-4 Stat, Transcription Factor S-II 
(Transcription factor S-H-related protein), Transcription factor StatSb, Transcription 
Factor, Transcription factor (CBFB), 9G8 Splicing Factor (Pre-mRNA Splicing 
factor SRP20), CC 1.3 (Splicing factor (CC1.3)), HnRNP F protein, 
HNRPA2B 1 (Heterogeneous nuclear ribonucleoproteins A2/B 1) , 
HNRPG(Heterogeneous nuclear ribonucleoprotein G), HNRPK(Heterogeneous 
nuclear ribonucleoprotein K), Pre-mRNA splicing factor helicase, Pre-mRNA 
splicing factor SF2, P33 subunit, Pre-mRNA splicing factor SRP20, Pre-mRNA 
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splicing factor SRP75, PRP4(Serine/threonine-protein kinase PRP4), PTB- Associated 
Splicing Factor, Ribonucleoprotein A\ Ribonucleoprotein Al, Ribonucleoprotein 
C1/C2, RNP Protein, L (Heterogeneous nuclear ribonucleoprotein L), RNP-Specific 
C(U1 small nuclear ribonucleoprotein C ), SAP 145(Spliceosome associated protein 
5 ), SAP 61(SpIicesomal protein), SC35(Splicing factor), SF3al20, SFRS2(Splicing 

factor, arginine/serine-rich 2), SFRS5(Splicing factor, arginine/serine-rich 5), 
SFRS7(Splicing factor, arginine/serine-rich 7), Small nuclear ribonucleoprotein SM 
Dl, SnRNP core protein Sm D2, SnRNP core protein Sm D3, SNRP70(U1 snRNP 
70K protein), SNRPB(Small nuclear ribonucleoprotein polypeptides B and Bl), 

10 SNRPE(Small nuclear ribonucleoprotein polypeptide E), SNRPN(Small nuclear 

ribonucleoprotein polypeptide N), Splicing factor SF3al20, Splicing factor U2AF 
35 kD subunit, Splicing factor U2AF 65 kD subunit, SRP30C(Pre-mRNA splicing 
factor SF2, p33 subunit), SRP55-2(Pre-mRNA splicing factor SRP75), Transcription 
factor BTEB, Transcription initiation factor TFIID 250 kD subunit, Cleavage and 

15 polyadenylation specificity factor, Cleavage stimulation factor, 3' pre-RNA, subunit 

1, 50kD, Cleavage stimulation factor, 3' pre-RNA, subunit 3, 77kD, HNRNP 
Methyltransferase, PABPLl(Poly(A)-binding protein-like 1), Pap mRNA(Poly(A) 
Polymerase), RNA unwinding, RNA Helicase, GU Protein (ATP-Dependent RNA 
helicase dead), KIAA0224 Gene(Putative ATP-dependent RNA helicase), RNA 

20 Helicase A, RNA Helicase PUO, and Stel3(Nuclear RNA Helicase). 



1 14. The method, inhibitor, pharmaceutical composition, or nucleic acid probe of 
any of claims 6, 18, 26, 34, 47, 63, 92, and 106, wherein said gene is selected from 
the group consisting of AP47(Clathrin Coat Assembly AP47), AP50(Clathrin Coat 
25 Assembly Protein AP50), Cell Surface Protein (Clathrin Heavy Polypeptide-Like 

Protein ), Cltb(Clathrin Light Chain B), Cite (Clathrin Heavy Chain), Adenylate 
Cyclase, Adenylate Cyclase, Adenylate Cyclase, II, Adenylate Cyclase,IV, Complex 
I, MTND1 (Subunit ND1), MTND2 (Subunit ND2), MTND3 (Subunit ND3), 
MTND4 (Subunit ND4), MTND4L (Subunit ND4L), MTND5 (Subunit ND5), 
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MTND6 (Subunit ND6), Complex n, Complex HI, Cytochrome b subunit, Complex 
IV, COl (Cytochrome c Oxidase Subunit I), C02 (Cytochrome c Oxidase Subunit 
2), C03 (Cytochrome c Oxidase Subunit 3), Complex V, ATP Synthase Subunit 
ATPase 6, Kinesin Heavy Chain, Kinesin Light Chain, Syntaxin la, Syntaxin lb, 
Syntaxin 3, Syntaxin 5a, Syntaxin 7, CANX (Calnexin), ER Lumen Protein 1, ER 
Lumen Protein 2, Ribophorin I, Ribophorin II, Signal recognition particle receptor, 
SRP Protein, TIM17 preprotein translocase, Golgin-245, TGN46 (Trans-Golgi 
Network Integral Membrane Protein TGN38 Precursor), Beta-Cop, Coatomer Beta' 
Subunit, Coatomer Delta Subunit, Gp36b Glycoprotein (Vesicular integral-membrane 
protein VIP36 precursor), Homologue of yeast sec7, Protein transport protein SEC13 
(Chromosome 3p25), SEC14 (S. Cerevisiae), Synaptic vesicle membrane protein 
VAT-1, Synaptobrevin-3, Synaptotagmin I, Transmembrane(COP-coated vesicle 
membrane protein p24 precursor), Vacuolar-Type (Clathrin-coated vesicle/synaptic 
vesicle proton pump 116 kd subunit ), 140 kD Nucleolar phosphoprotein, 
Autoantigen p542, Export protein Rael (RAE1), Heterogeneous nuclear 
ribonucleoprotein Al , Nuclear pore complex protein hnupl53, Nuclear pore complex 
protein NUP214, Nuclear pore glycoprotein p62, Nuclear Transport Factor 2, 
Nucleoporin 98 (NUP98), NUP88, Ribonucleoprotein A, Ribonucleoprotein B M , 
Karyopherin, Importin Alpha Subunit, TRN (Transportin), Actin, Beta-Centractin, 
Capping Protein Alpha, CFL1 (Cofilin, Non-Muscle Isoform), Desmin, Dystrophin, 
Gelsolin, hOGGl(Myosin Light Chain Kinase), IC Heavy Chain, Itga2 (Integrin, 
Alpha 2 (CD49B, alpha 2 Subunit of VLA-2 receptor)), Itga3 (Integrin Alpha-3 
Precursor), Keratin 19, Keratin, Type n, Lamin A, LBR(Lamin B Receptor), Light 
Chain Alkali, MacMarcks mRNA, MAPla (Microtubule-Associated Protein 1A), 
MAP2(Microtubule-Associated Protein 2), MEGl(Protein-Tyrosine Phosphatase 
MEG1), Microtubule-Associated Protein TAU, Suppressor Of Tubulin STU2, TUBg 
(Tubulin Gamma Chain), Tubulin Alphas Chain, USHlb (Myosin II Heavy Chain), 
Villin, Villin 2 (Ezrin), Actin Depolymerizing, Capping (Actin Filament), 
MYH9(Myosin, Heavy Polypeptide 9, Non-Muscle), MYL5(Myosin Regulatory 
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Light Chain 2), Myosin Heavy Chain 95F, Myosin Heavy Chain IB, Myosin IB, 
Sh3pl7(Myosin IC Heavy Chain), Sh3pl8(Myosin IC Heavy Chain), 
KIAA0059(Dematin:Actin-Bundling Protein), TTN (Titin:Myosin Light Chain 
Kinase), ATP6c(Vacuolar H4- ATPase proton channel subunit), ATP6al (ATPase, 
H+ Transporting, Lysosomal (Vacuolar Proton Pump), Alpha Polypeptide, 70kD), 
ATP6M (ATPase, H+ transporting, lysosomal (vacuolar proton pump), beta 
polypeptide, 56/58kD), ATP6d(ATPase, H+ transporting, lysosomal (vacuolar 
proton pump) 42kD), ATP6e( ATPase, H + transporting, lysosomal (vacuolar proton 
pump) 31kD), ATPase, H+ transporting, lysosomal (vacuolar proton pump) 31kD, 
and Superoxide Dismutase. 

115. A method for identifying an inhibitor potentially useful for treatment of cancer, 
wherein said inhibitor is active on a conditionally essential gene, and wherein said gene 
is subject to loss of heterozygosity in a cancer, said method comprising the steps of: 

(a) determining at least two alleles of a said gene; 

(b) testing a potential allele specific inhibitor to determine whether said 
potential allele specific inhibitor is active on at least one but less than all of said alleles; 

wherein inhibition of expression of at least one but less than all of said alleles 
or reduction of the level of activity of a product of at least one but less than all of said 
alleles in the presence of said potential allele specific inhibitor is indicative that said 
potential allele specific inhibitor is a said inhibitor. 

116. An inhibitor potentially useful for treatment of cancer, wherein said inhibitor 
is active on an allelic form of a conditionally essential gene, said gene has at least 
two alternative alleles in a population, and 

wherein said inhibitor targets at least one but less than all of said alternative 

alleles. 

117. A pharmaceutical composition, comprising 
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at least one allele specific inhibitor targeting at least one but less than all 
allelic forms of a conditionally essential gene in a population; and 
a pharmaceutically acceptable carrier or excipient. 

5 1 18. A method for producing an inhibitor potentially useful for cancer treatment, 

wherein said inhibitor is active on at least one but less than all alternative alleles of 
a conditionally essential gene having at least two alternative alleles, comprising the 
steps of: 

(a) identifying a conditionally essential gene that has alternative allelic forms 
10 in a noncancerous cell, wherein one of said alternative allelic forms is deleted in a 

cancer cell; 

(b) screening to identify an inhibitor which inhibits said at least one but less 
than all of said at least two alternative alleles; and 

(c) synthesizing said inhibitor in an amount sufficient to produce a therapeutic 
15 effect when administered to a patient suffering from a cancer in whom cancerous 

cells have only an allele of said gene inhibited by said inhibitor and in whom normal 
cells are heterozygous for said gene and contain an allelic form not inhibited by said 
inhibitor. 



20 119. A method for preventing the development of cancer in a patient having a 

precancerous condition, comprising the steps of: 

a. subjecting cells of said precancerous condition to an altered condition such 
that a first conditionally essential becomes essential; 

b. administering to said patient a therapeutic amount of a first allele specific 
25 inhibitor targeted to an allele of said first conditionally essential gene present in cells 

of said precancerous condition, wherein the normal somatic cells of said patient are 
heterozygous for said first gene, said inhibitor is active on at least one but less than 
all allelic forms of said gene present in a population and targets only one allelic form 
present in said normal somatic cells; and 
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wherein cells of said precancerous condition have undergone LOH of said 
first gene. 

120. The method of claim 119, wherein the cells of said precancerous condition 
5 are not clonal from a single cell, further comprising the step of: 

c. serially administering to said patient at least one additional allele specific 
inhibitor, wherein each of said at least one additional allele specific inhibitors targets 
a different allele of a conditionally essential gene or an essential gene than is targeted 
by said first allele specific inhibitor, wherein said different allele may be a different 
10 allele of said first gene or an allele of a different gene, and wherein said patient is 

heterozygous for each targeted gene and each targeted gene has undergone LOH in 
cells of said precancerous condition. 

121 . A method for treating a patient suffering from a cancer, wherein said patient 
15 is heterozygous for a conditionally essential gene, comprising the steps of: 

a) subjecting cells of said cancer to altered conditions such that said gene 
is essential; and 

administering a therapeutic amount of an allele specific inhibitor active on at 
least one but less than all allelic forms of said gene present in a population, 
20 wherein said allele specific inhibitor inhibits only one allelic form of said 

gene present in said patient, and said only one allelic form of said gene is present in 
cancer cells in said patient. 

122. The method of claim 121, further comprising the steps of: 

25 (a) determining whether non-cancerous cells of said patient are 

heterozygous for a particular conditionally essential gene; or 

(b) determining whether cancerous cells of said patient have only one 
allele of said particular gene; or 

(c) both (a) and (b). 
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123. A method of inhibiting growth of a cell comprising the steps of: 

a) subjecting said cell to conditions such that said gene is essential; and 

b) administering at least one inhibitor active on an allele of said 
conditionally essential gene, 

5 wherein said inhibitor is less active on at least one other allele of said gene. 

124. A method of identifying a potential patient for treatment with an inhibitor 
active on at least one but less than all alleles of a conditionally essential gene, 
wherein said patient is suffering from a cancer, said method comprising the step of: 

10 identifying a patient heterozygous for a said gene, 

wherein if said patient is heterozygous for said gene, then said patient is a 
potential patient for said treatment. 

125. The method of claim 124, further comprising the step of determining whether 
15 cancer cells in said patient contain only a single allele of said gene, 

wherein if said cancer cells contain only a single allele of said gene, then said 
patient is a potential patient for said treatment. 

126. A method of identifying a potential patient for treatment with an inhibitor 
20 active on at least one but less than all alleles of a conditionally essential gene, 

wherein said patient is suffering from a cancer, said method comprising the step of: 
determining whether cancer cells in said patient have undergone LOH of a 
said gene, 

wherein if said cells have undergone LOH of said gene, then said patient is 
25 a potential patient for said treatment. 

126. A nucleic acid probe at least 12 nucleotides in length which is perfectly 
complementary to a portion of a first allelic form of a conditionally essential gene, 
wherein said portion comprises a sequence variance site, and wherein 
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said probe hybridizes under stringent hybridization conditions to said portion and not 
to a corresponding portion of a second allelic form having at least one different 
nucleotide at said sequence variance site. 

5 127. A method for selecting a patient for treatment with an antiproliferative 

treatment, comprising the steps of: 

a) determining whether normal somatic cells in a potential patient are 
heterozygous for an essential or conditionally essential gene, wherein a first allelic 
form of said gene is more active than a second allelic form, and wherein a reduction 

10 in the activity of said gene in a cell increases the sensitivity of said cell to a said 

antiproliferative treatment; and 

b) determining whether cancer cells of said patient have only said second 
allelic form of said gene, 

wherein if said somatic cells are heterozygous and said cancer cells have only 
15 said second allelic form, it is indicative that said patient is suitable for treatment with 

said antiproliferative treatment. 



128. A method for selecting an antiproliferative treatment for a patient suffering 
from a cancer, comprising the steps of: 

20 a) determining whether normal somatic cells in a potential patient are 

heterozygous for an essential or conditionally essential gene which reduces the 
sensitivity of cells to an antiproliferative treatment, wherein a first allelic form of said 
gene is more active than a second allelic form, and wherein a reduction in the activity 
of said gene in a cell increases the sensitivity of said cell to a said antiproliferative 

25 treatment; and 

b) determining whether cancer cells of said patient have only said second 
allelic form of said gene, 

wherein if said somatic cells are heterozygous for said gene and said cancer 
cells have only said second allelic form, it is indicative that said antiproliferative 
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treatment is suitable for said patient. 

1 29. The method of any of claims 1 1 5-1 29, wherein said gene is selected from the 
group consisting of: 

galactose- 1 -phosphate uridyltransferase, galactose kinase, UDP galactose-4- 
epimerase, methionine synthase, asparagine synthase, glutamine synthetase, multidrug 
resistance gne/Pglycoprotein, multidrug resistance associated proteins 1-5, bleomycin 
hydrolase, dihydropyrimidine dehydrogenase, P-ureidopropoinase, P-alanine 
synthetase, cytidine deaminase, thiopurine methyltransferase, CYP1A1, CYP1A2, 
CYP2A6, CYP2A7, CYP2B6, CYP2B7, CYP2C8, CYP2C9, CYP2C17, CYP2C18, 
CYP2C19, CYP2D6, CYP2E1, CYP2F1, CYP3A3, CYP3A4, CYP3A5, CYP3A7, 
CYP4B1, CYP7, CYP11, CYP17, CYP19, CYP21, CYP27, glutathione transferase 
alpha, glutathione transferase theta, glutathione transferae mu, glutathione transferase 
pi, methylguanine methyltransferase, 3-alkylguanine alkyltransferase, 3-methyladenine 
DNA glucosylase, DNA dependent protein kinase, catalytic subunit of DNA-PK, DNA 
binding subunit of DNA-PK Ku-70 or Ku-80 subunit, KARP-1, Poly(ADP-ribose) 
polymerase, Fanconi Anemia genes A, B, C, D, E, F, G, and H, ERCC-1, 
ERCC2/XPD, ERCC3/XPB, ERCC4, ERCC5, ERCC6, XPA, XPC, XPE, HHR23A, 
HHR23B, uracil glycosylase, 3-methyl adenine DNA glycosylase, NF-kappa B, 
XRCC4, XRCC5/Ku80, XRCC6, XRCC7, glutathione-X-transferase, I-kappa B alpha, 
HSP70, HSP27, and 9-oxoguanine DNA glycosylase. 

131. A method for identifying a potential patient undergoing transplantation for 
treatment with an inhibitor active on at least one but less than all alleles of an 
essential gene, comprising the step of: 

identifying a patient undergoing an allogenic bone marrow transplantation in 
which the donor tissue contains at least one alternative allele of an essential gene 
different from the alleles in somatic cells of said patient. 
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132. The method of claim 131, wherein said donor or said recipient is 
homozygous for an alternative allelic form of an essential gene that is not present in 
the other of said donor or said recipient. 

5 1 33 . A method for treating graft versus host disease in a patient receiving allogenic 

bone marrow transplantation, said method comprising the step of 

administering to said patient at least one allele specific inhibitor specific for 
at least one but less than all of the allelic forms of an essential gene in a population, 
wherein said inhibitor inhibits stimulation of the donor immune system, and cells of 

10 the said patient comprise an allelic form of said gene not present in the donor bone 

marrow. 



134. The method of claim 133, wherein said allele specific inhibitor is selected by 
identifying at least one alternative alleles of an essential gene present in the donor 

15 tissues but absent in the normal somatic cells of said patient; and 

selecting a said inhibitor active on a said alternative allele of an essential gene 
present in said donor tissues but absent in the normal somatic cells of said patient. 

135. The method of claim 134, wherein said at least one inhibitor recognizes both 
20 alleles of said essential gene that are present in said donor, but not both alleles of 

said gene that are present in said patient. 

136. A method for enhancing engraftment of an allogenic bone marrow transplant, 
comprising the step of administering to a patient receiving said transplant an allele 

25 specific inhibitor which kills or suppresses the patient's bone marrow but not the 

donor bone marrow, thereby providing space for engraftment of the donor cells 
within the marrow cavity. 

137. The method of claim 136, wherein the allele specific inhibitor is selected by 



WO 98/41648 



PCT/US98/05419 



387 232/116 

identifying alternative alleles of an essential gene that are present in the recipient but 
not the donor marrow. 



138. The method of claim 137, wherein said allele specific inhibitor recognizes 
both allelic forms of the essential gene that are present in the recipient, but not both 
allelic forms of the same gene that are present in the recipient. 

139. A method for treating or preventing chimerism in allogenic bone marrow 
transplantation, comprising 

selectively killing or suppressing proliferation of the patient's own cells 
without toxicity to the donor cells by 

administering to a patient receiving said transplantation at least one allele 
specific inhibitor active on at least one but less than all alternative alleles of a gene 
vital for cell growth or viability, wherein said inhibitor targets the allelic form or 
15 forms of a gene in bone marrow of said patient but does not target at least one allelic 

form of said gene in the donor bone marrow. 



10 



140. A method for treating cancer in a patient receiving allogenic or autologous 
transplantation, comprising the step of 

20 administering to said patient at least one allele specific inhibitor which kills 

or inhibits the growth of cancer cells without toxicity to the transplanted marrow. 

141. The method of claim 141, wherein said transplantation is autologous 
transplantation and said at least one allele specific inhibitor is selected to be active 

25 on the allele of an essential gene remaining in the cancer cells due to LOH in patients 

whose normal somatic cells are heterozygous for said essential gene, but not on the 
alternative allele of said gene present in said normal somatic cells, 

whereby said administration enables continuing therapy of cancer without 
suppression of the transplanted marrow. 
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142. The method of claim 140, wherein said transplantation is allogenic 
transplantation and said allele specific inhibitor recognizes both alleles of said 
essential gene that are present in the recipient, but not both forms of the said gene 
that are present in said patient. 

143. A method for eliminating malignant cells from transplanted marrow during 
autologous transplantation of a patient heterozygous for an essential gene, 
comprising 

contacting cells from harvested autologous bone marrow ex vivo with at least 
one allele specific inhibitor active on at least one but less than all alternative alleles 
of said essential gene, wherein said inhibitor targets an allelic form of said gene 
present in cancer cells of said patient but does not target an alternative allele of said 
gene present in normal cells from said autologous bone marrow, 

wherein said gene has undergone LOH in cancer cells of said patient. 

144. The method of claim 143, wherein said autologous bone marrow is harvested 
from said patient prior to high dose radiation or chemotherapy. 

145. The method of claim 143, further comprising the steps of: 

a. identifying one alternative allele of an essential gene remaining in the 
cancer cell due to LOH in patients who are heterologous with two different 
alternative forms of the essential gene in normal cells of the autologous bone 
marrow; 

b. cultivating said autologous bone marrow ex vivo in the presence of an 
allele specific inhibitor that inhibits the allele that is present in the cancer cells, but 
not the heterologous allele that is present in the normal bone marrow. . 

146. The method of claim 143, wherein said autologous bone marrow is contacted 
with a plurality of said allele specific inhibitors. 
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147. A method for separating a first cell from a mixture of cells, comprising the 
steps of: 

a) providing an allele specific binding compound which binds to at least 
one but less than all alleles of a gene, wherein a said allele of said gene expressed 

5 in said first cell is not expressed in other cells of said mixure of cells or is expressed 

in other cells in said mixture of cells and not in said first cell; 

b) contacting said mixture of cells with said binding compound under 
conditions such that said binding compound binds to said allele and not to non-target 
alleles; and 

10 c) separating bound cells from unbound cells. 

148. The method of claim 147, wherein said mixture of cells comprises normal 
somatic cells and cancer cells from a patient, said first cells are said normal somatic 
cells, and said first cells express a said allele deleted in said cancer cells due to LOH 

15 of said gene, comprising 

separating said normal somatic cells from said cancer cells. 

149. The method of claim 147, wherein said allele specific binding compound is 
an antibody or antibody fragment. 

20 

150. The method of claim 147, wherein said binding compound is attached to a 
solid support. 
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SSCP Overview 



PCR amplify gene from 36 cell lines: 

1. Purify RNA from 36 cell lines. 

2. Use reverse transcriptase to make 

cDNA from ail ceil lines. 

3. Do PCR with radioactive nucleotides or 

kinased primers to amplify and label 
overlapping .2-1.5 kb fragments 
spanning target gene cDNA. 



Set up three different restriction digests to 
cut the radiolabelled PCR products from all 
36 cONAs into 100-400 base pair fragments. 



£ 



Digest #1 



Digest #2 



Digest #3 



Run the three sets of 36 
digests on non-denaturing 
35 cm acrylamide gets, 108 
wells per gel, two gel types. 



£ 



£ 



Gel #1: 
5,5% Acrylamide, 
0.5X TBE buffer 




Gel #2: 
8% Acrylamide, 
10 % Glycerol, 
1X TTE buffer 









E 



Dry gels, expose to X-ray film and 
inspect autoradiographs to identify 
variant DNA bands. 



I 



Select variant cDNA samples and 
use DNA sequencing to determine 
sequence of va riant alleles. 



Use agarose gels, SSCP or DNA 
sequencing to determine genotype 
of additional individuals in order to 
better determine allele frequencies 

in different population groups. 
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16 
13 

"21 



Ovary CR 53:2393 * 

0.15 Qvary^ ^^^ ^ ^ B ^J C ^6!9: 4 2^ ^ 

0.12 0var >^ BJC 69:429 



0 0 Prostate G 11:530 




WO 98/41648 



46 / 214 



PCT/US98/05419 



Chromosome 4 - p Arm 

SUM 729 93 0.13 



WO 98/41648 



47 / 214 



PCT/US98/05419 



Chromosome 4 - q Arm 



Band 



Mark: or 



Total 



Cases w/LOH 



LOH Froq. 



33-35 



D4S171 



29 



15 



0.52 



Tumor Type 



Reference 



Unknown 



Unknown 



20 



tt?^»:r<g?yWCT3*»?^ 



0.1 Brain 



Bladder CR 55 :521 3 



CR 



3reast 




50: 5784 
BCRT 32^5 



53:4356 



21-23 



ADH3 



22 



Unknown 



D4S163 



41 




CR 54:4481 



Unknown D4S415 26 8 0.31 Cervix CR 56:197 



Unknown D4S415 



Endocrine CR 56:599 



Unknown D4S163 35 



Unknown 



D4S402 



20 



1 



0.2 6 Es ophag e al GCC 10: 17 "7^ 

0.05 Head&Neck CR 54:4756 



Unknown D4S163 23 2 

£m£ 

Unknown 



0.09 Ki „ dne V , CR 51:820 

Otis- .l^NKianevI ^siAS' 
Kidne 




D4S426-D4S415 5 0 



w ^ PNAS 92:2854 



Unknown 



21-23 

Unknown D4S16 5 2 

pll-q21 MT2P1 16 



Liver 
Liver 



Unknown 8 0 0 

ADH3 6 1 0 . 17 

0 . 4 Liver 

_ _ 0.5 Liver 

pll-q21 MT2P1 19 4 0.21 L iver CR 5 4 : 281 



_ BJC 64 : 1083 
CR 51:4367 
JJCR 81: 108 
JJCR 81:108 



ADH3 



18 



2>23 ^ ^ 



0.06 



Ovary 



IJC 54:546 



11.0-15 D4S1607 20 3 0.15 Ovary BJC 69:429 

^ 33-35 D4S17J ^ 12 ^ 4 0 .33 Ovary ^ BJC J69: 429 

Unknown D4S27 29 10 0.34 Ovary CR 53:2393 

35 Unknown 6 1 0.17 Pancreas CR 54:2761 

Unknown D4S163 17 3 0.18 Sarcoma CR 52:2419 



WO 98/41648 PCT/US98/05419 

48 / 214 



Chromosome 4 - q Arm 



WO 98/41648 



49 / 214 



PCT/US98/05419 



Chromosome 5 - p Arm 



Band Marker Total Cases w/LOH LOH Preq 

Unknown D5S392 19 0 0 



Unknown D5S392 
Unknown D5S392 19 
Unknown D5S13 21 




CR 56:599 



Kidney 



pter-olS 



^pter-pl5 



D5S4 
D5S4 



11 0 



0.1 



Breast 



PNAS 92:2854 
CR 53:4356 
GCC 2:191 



Dter -pl5 D5S4 19 4 

pter-pl5 D5S4 1 0 




15.1-15.2 D5S406 25 12 0.48 Cervix 



CCG 52:72 



MM 



JNCI 87:742 



0.38 Cervix CR 54:4481 



„L 5 i;lld.J?Ail2 ]1 0 0 Prostate G 11:530 

15,1-15.3 D5S208 20 10 0.5 Cervix JNCI 87*742 

15-21 £5S630 13 3 0.23 Lung 0 12-97 

JJ^}.Z}±:2 _JS 8 0. 32 Cervix JNCI 87: 742 

1 0.05 Uterus CR 54:4294 

Unknown D5S419 26 3 0.12 Cervix CR 56:197 



15.1-15.3 D5S117 22 



Unknown D5S419 16 3 0.19 Head&Neck CR 54:4756 



0.21 



Cervix 
Cervix 



Unknown D5S395 28 

Oil? 

11.0-13 D5S21 9 5 0.56 

Unknown Unknown 4 0 0 Brain 

Unknown Unknown 5 0 0 Colon 
Unknown 



CR 56 : 197 
CR 54 : 44 81 




Unknown Unknown 



BJC 65:809 



WO 98/41648 



50 / 214 



PCT/US98/05419 



Chromosome 5 - p Arm 



SUM 



722 



135 



0.19 



WO 98/41648 



51 / 214 



PCTAJS98/05419 



Chromosome 5 - q Arm 



Band Marker Total 



15-21 D5S491 8 

11.2-13.3 _ D5S6 30 
U. 2-13. 3 D5S6~" *"~ >v ""' 32 



/LOH LOH Freq, Tumor Typo Reference 



15-21 



D5S637 5 



19 



15-21 D5S626 

Unknown D5S107 

Unk nown D5S107 30 

Unknow n D5S37 2 

Unknown D5S37 28 

Unknown D5S37 12 



15-21 D5S644 9 



0.3B 
0.03 



B Lung 



0 12 :97 
GE 5:554 



Breast 

0.28 Colon CR 50:7166 



0.2 Lung 0 12:97 

Lung 



R2> 



Uterus CR 54:42 94 



0 9:991 



Colon 

Esophageal CR 54:2996 



llTJlllllMm M ulO^ 



il l I i N il TOnil TTTn T l nn in M ii l iMTfannM^ ~— 



Sarcc 



CR 52:2419 



0.33 Luno 0 12:9"? 



14-21 



D5S71 



10 



0.1 



Colon 



S 241:961 



14-21 
14-21 



em 



14-21 




D5S71 
D5S71 
D5S71 



21 



D5S71 



( 0^38 < 
0.05 

o 



Colon ^C C 3:468 

Ovarv IJC 54 : 546 



Stomach 



Uterus 
Stomach 



GCC 3M6B 
CR 51:5632 
CR 56:612 



Unknown D5S409 17 6 0.35 oiumatu 

14-21 D5S82 15 4 0.27 Colon JJCR 82:10 

21 D5S421 25 5 0.2 Bladder CR 55:5213 

21 D5S421 5 0 0 Kidney GCC 12:76 

Unknown D5S61 31 19 0.61 Colon CR 50:7166 

21-22 D5S81 13 4 0.22 Colon JJCR 82:10 

21-22 D5S81 13 3 0.23 Kidney CR 51:820 



WO 98/41648 



52 / 214 



PCT/US98/054I9 



Chromosome 5 - q Arm 



21-22 D5S81 



Unknown 



D5S81 



4 

12 



wmmmmmrnmmmmmmm 



J21 
21 



0 Liver SJC 67:1QQ 

0.4 2 Stomach HG 92: 244 



Unknown L5.71 

Unknown L5.71 



MCC 4 
MCC 34 



MBMiMl'lfl i W iiiiii M m i ■■ ■ ill 




0.25 
0.35 



C olon 0 9:991 



1 Lung 

1 Lung 



Colon EJC 30A:66 



CR 52:2478 



MmmmmmmmBmmmrmmmmm 



1 Lunc 



Lung 



: ""known MCC 4 0 0 Lun 

_ 21 MCC 36 4 



CR 52:2478 

H 

CR 52:2478 

mm: 



CR 52:2479 



n 0 -1 1 Sto mach 




,,., w ^£l ww ., MCC-APC 25 7 0.28 Breast BJC 68:64 

JLi^ MCC-APC ^ 45 16 Q 36 Colon GAST 104:1 

21 MCC-APC 26 



21 



20 0.77 Esophageal PNAS 89:33 



...,„„ m ™ m ^™^2££^*F9 _~ 5 2 °- 4 Lung CR 52:1995 

21 ,„._ „A?C 21 7 0.33 Colon CR 52 : 741 



_HTJ5£2!£!L ,.,....,„„. Ap C 33 6 O.IS Colon EJC 30A:66 

, „ , A , pc 36 24 0.67 EsoDhaaeal CR 52:6525 

^^Jl * p Jr 20 14 0.7 Luna 0 12:97 

,^JIL, 7 5 0.71 Lung CR 54:1772 

Unknown APC is 9 0 .j 0vary G0 55 : 24 5 



21 APC 



Unknown 



APC 



Ovary 

0.43 Prostate BJ ^JJ^ :3S0 

0.09 Stomach CL 96:169 



WO 98/41648 



53 / 214 



PCT/US98/05419 



Chromosome 5 - q Arm 



^ 21 ^ _ ^ ^ ^ A PC ^ 14 ^12 ^ 0 . 86 Stomach ^ JJCR 8^10 

21-22 D5S346 46 1 0.02 Kicinev BJC 69:230 

21-22 D5S346 18 2 0.11 Stomach CR 56:612 

Unknown Unknown 19 3 0.16 Colon JJCR 82:10 

21-22 ^ ^ D5S84 11 2 0. 18 , Sreast 

21-22^ ^D5S8^ 3 ^ ^ ^1 0.33 ^ Cervix^ ^ ^ GCC ^ 9: 11 9 ^ 

21-22 D5S84" 5 2 0.4 Kidney CR 51:5817 

21-22 D5S84 9 4 0 . 44 Liver CR 51:8 9 

D5 S 8 4 13 1 0.08 Uterus GCC 9 : 119 

21-22 D5S86 4 1 0.25 Pancreas GCC 3:468 

m ^GO 55:24 5 



21-22 



31-33 



D5S804 



19 



0 . 3 2 Ovary^ 



21-22 FBN2 15 4 0. 27 Stomacn CR 56:612 

33-35 D5S70 3 0 0 Colon GCC 3:463 



33-35 D5S70 13 5 



0.38 



ch GCC 3:468 



21-22 



D5S178 



15 




0^4 
0 



31-32 GRL 8 0 0 Ovary CR 50:2 724 

21-22 D5S210 19 5 0.26 Stomach CR 56: 612 



21-22 



;I55 

34-qter 



D5S209 
D5S2 



23 2 0.09 Stomacn CR 56: 612 

3 l 0.33 Cervix CR 49- 3598 



34-oter 



SIS 



Unknown 



D5S2 
D5S2 
JD5S2 i 
D5S400 




34-qter D5S2 2 0 0 



Unknown D5S400 32 5 0.16 



^ltomacT''"^l:R 4~8:29B 8 
Cervix CR 56:197 



WO 98/41648 



54 / 214 



PCT/US98/05419 



Chromosome 5 - q Arm 



Unknown D5S429 
Jtfjqter D5S4 3 



19 



1 



0.05 



Kidnev _ PNA5 _ 9 r : . 2 . 8 

5 2 0.4 Colon BJC 67:100 

35-ater D5S43 10 0 0 Endocrine N 328:524 

35-oter D5S43 10 5 0.5 Liver BJC 64:108 

35-oi:er D5S43 11 0 0 Pancreas BJC 65 :809 

35-ater D5S43 34 8 0. 2 4 Stomach CR 51< 2926 

35-ater D5S43 25 5 0.2 Testis GCC 13:249 

15-21 Unknown 6 0 0 Cervix BJC 67:71 



Unknown Unknown 2 1 0.5 C ervix BJC 67:71 

Unknown Unknown 23 B 0.35 Colon JJCR 82:10 

mm, 



Unknown 



Unknown 



19 



0.37 



Colon 



JJCR 82:10 



Unknown Unknown 17 1 0.06 Colon JJCR 82:10 



Unknown Unknown 

15-21 Unknown 

.^^ r i;^. r ,. clip n 

Unknown CRI-L4 5 

21 D5S141 

L.I33S5 

21-22 LS5.34 



Unknown D5S410 

21 D5S133 

___2} D5S141 

Unknown Unknown 



L 

0.33 



Colon JJCR 82 :10 

Colon BJC 67: 100 




26 



15-2: 



10 

Unknown 5 



0. 67 

0.03 
0.17 
0.31 



Colon 
Colon 



2 0.1 Colon S 241:961 

3JC 67:100 
CR 50:7166 
~Cff 54 : 4756 
CR 51:5817 
CR 51:5817 



Head&Neck 



Kidnev 



Kidney 

0 . 1 Liver CR 51:89 

Liver BJC 67:100 



WO 98/41648 



55 / 214 



PCT/US98/05419 



Chromosome 5 - q Arm 



21-21-34-qter D5S43-D5S81 45 14 0.31 Liver JJCR 84:89 

" V ^ Unkn °^ ll ' ^ J^t"""^"" " N ' 0 „ Lung ^ ^PN 84 : 9252 

13-12 del-27 8 3 0.38 Lunq ^ "*~*'o* 12:97* 



21 D5S122 U 5 0.45 Ovarv GO 55:245 

2 1 " 2 2 .^"-L.^,. 15 ,.-^ 6 L mm Q -, 4 ,. ° var y ,,, BJC ..1 9: . 42 9 

D5S98 13 3 0.23 Stomach HG 92:24 4 

_____ 

D5S98 



15-21 

Hi 

15-21 
SUM 



oiHniiiii 



2866 



763 



0. 14 Testis GCC 13:249 

0.27 



WO 98/41648 



56 / 214 



PCT/US98/05419 



Chromosome 6 - p Arm 



Band 


Marker 


Total 


Cases w/LOH 


LOH Froq. 


Tumor Type 


Reference 
















24-25 


F13A1 


18 


5 


0.2B 


Ovary 


GO 55:245 












Unknown 


D6S309 


18 


1 


0.06 


Kidney 


PNAS 92:2854 


.arikjaowir,:,;J-^';^ : ': : 












pter-p25 


D6F21S1 


12 


4 


0.33 


Ovarv 


3JC 67:551 
















Unknown 


D6S289 


36 


13 


0.36 


Colon 


CR 56:145 
















21.3-24 


D6S109 


17 


3 


0.18 


Ovary 


BJC 69:429 
















Unknown 


D6S276 


20 


10 


0.5 


Cervix 


CR 56:197 














Unknown 


D6S299 


20 


0 


0 


Head&Neck 


CR 54 :4756 
















Unknown 


D6S105 


27 


2 


0.07 


Esophageal 


IJC 69:1 
















Unknown 


D6S105 


26 


2 


0.08 


Uterus 


CR 54 : 4294 
















Unknown 


D6S10 


35 


4 


0.11 


Breast 


GCC 2:191 












r?WiT^^.i^,Sl!(w,r,w.T 




Unknown 


D6S10 


2 


0 


0 


Pancreas 








«H 








•; ; Gm:::S3q:vfr^:.- 


Unknown 


D6S10 


32 


4 


0.12 


Testis 


0 9:2245 














21.3 


HLA-DOA 


18 


4 


0.22 


Ovary 


BJC 67:551 



21.3 HLA-DQA 1 0 0 T estis CCG 52:72 ^ 

Unknown^ TtiFa 3 3 14 0.4 2 Colon CR 56:14j5^ 

Unknown D6S291 12 1 0.08 Brain CR 55:4696 

Unknown D6S29 22 3 0.14 Kidn ey CR 51^5£17^ 

Unknown D6S29 12 6 0.5 Ovary CR 51:5118 

Unknown D6S29 9 0 0 O vary CR 50:2724 

Unknown D6S271 44 17 0.39 Colon CR 56:145 

Unknown D6S282 22 0 0 Endocrine CR 56:599 

12.0-11 ^ KRAS PI 2 Q 0 Uteru s _^ CR 51:5632 

Unknown D6S257 42 13 0.31 Colon CR 56:145 



WO 98/41648 
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Chromosome 6 - p Arm 



Unk nown 
Unknown 



Unknown 14 



0.07 Brain CR 50:5783 



D6S40 



28 



Unknown D6S344 22 0 



0. 18 



Breast 



CR 50:7184 



Enaocrme CR 56:599 



;Unkinowrv.: 



MM. 



Unknown 
Unkno wn 

21.3 

Unknown 



D6S40 



23 



0.3 



D6S265 



19 



0.42 



CR 54 : 2996 



:: -Esophageal} 
Esopnageal 



Head&Neck 



CR 54:1152 



D6S138 



34 



0. 18 



Kidney 



D6S4-C2-D6S1 



19 



Unknown 20 



5^ 



CR 51:5817 



0.26 



Kidney 



CR 49:5087 




CCG^52jJ2^_ 



WO 98/41648 



58 / 214 



PCT/US98/05419 



Chromosome 6 - q Arm 



Total 



Cases w/LOH 



LOH 

















Unknown 


D6Z1 


22 


0 


0 


Stomach 


GCC 14:28 
















13 


D6S254 


5 


0 


0 


Breast 


BJC 73:144 












Breast 


BJC 71:290 


14-15 


D6S284 


26 


5 


0.19 














16.3-21 


D6S286 


27 


8 


0.3 


Breast 


BJC 71:290 
















16.3-21 


D6S286 


17 


1 


0.06 


Endocrine 


CR 56:599 
















Unknown 


EDDR1 


14 


4 


0.29 


Ovary 


GCC 15:223 
















22.3-23.1 


D6S270 


22 




0.32 


Ovary 


GCC 15:223 
















Unknown 


D6S310 


33 


10 


0.3 


Ovary 


GCC 15:223 
















Unknown 


D6S311 


6 




0. 67 


Endocrine 


CR 56:599 
















Unknown 


D6S194 


4 


0 


0 


Ovary 


CR 52:5815 


Unknown 


D6S194 


16 


4 


0.25 


Ovary 


CR 52:5815 












••^•:y':Kidnay :: i;r;:'L 




Unknown 


D6S142 


6 


0 


0 


Ovary 


CR 52:5815 



Unknown 



D6S142 



CR 52:5B15 



D6S161 11 0 

D6S161 



Unknown 
Unknown 

Unknown ^ D6S251 36 13 

Unknown D6S251 28 

13 

E^W 55 ? 



0 



Ovar y CR 52:5815 

Ovary CR 52:5815 

:-:Braas£Mff ^ ' ? 

Colon CR 56:145 



0 Ovary CR 5 5 : 21 69 



Ov ary 
^:0^33^:^BoBreS&V 



D6S239 



10 



14-16.2 D6S252 



0.3 



Ovary 



CR 55:2169 



48 



ii 



0.23 



14 



D6S300 



32 



Breast 



BJC 73:144 



0.34 



Breast 



BJC 71:290 



..^^jj-. 3 : iUL ? 6S2 1 6 .. ., 27 _ w 9 0.33 B reast BJC 7 1:290 

.^JL k _ n ° wn , tt , D6S2 „ 46 _ _9 2 0.22 Ovarv CR 55:216 9 

16.3-21 D6S283 30 5 



0.17 



Breast BJC 71:290 



WO 98/41648 
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Chromosome 6 - q Arm 



Unknown 



16.3-21 



D6S26B 



4 



0.25 



D6S302 



Kidnev 



: WW-V-WfiiAi"-' " ' Wi 1 .m. WW!.' . ... 



GCC 12 




30 



13 



0.43 



Breast 



BJC 73:14 4 



2 1-23 



D6S261 



25 



0.2 



Breast BJC 73:144 



21-23 



D6S287 



*!7 r T JLMIi:W I . I J.ftM.M. ll ... 



22 



0.18 



Endocrine 



CR 56:599 



Ems 



22.3-23.1 



ARG 



12 



0.17 



Breast 



BJC 73: 144 



22.3-23 .1 



D6S262 



mm 



mmmm 



::;^&632&Z^;i- 



28 



10 



mi 



0.36 Breast 



BJC 73:14 4 



Unknown 



D6S262 17 



0.06 



Head&Neck 



CR 54 :4756 



Unknown 



D6S32 IB 



0.5 



Stomach 



GCC 14:28 



23. 1 
23.1 



D6S87 




_22-23^ 
22-23 



22-23 



D6S87 

^^^^ 

MJTB^ ^ 11 

S£bT ' 

MYB 

MYB 7 



0. 18 



Colon 



N 331:273 



■"20-. 



13 



0 * 



Liver 



JJCR 81:108 



0.43 



Melanoma 



CR 51:5449 



^ r iKeU"rbblas ton* • 9 i XQ9 5 U 



0.67 



Ovary 




BJC w 67:551 



CR 5 0:2724 
CR 52:2419 



CR 48:2988 



0,14 Uterus CR 51:5632 



Unknown 
Unknown 



D6S250 



24 ii: 
10 



D6S136 3 



3 



0.3 



Ovarv 



CR 55:2169 



Ovary 



CR 52:5815 



Unknown 



D6S441 



11 



0.09 



Endocrine CR 56:599 



24-27 ESR 



16 



0 



Cervix 



CGC 79:74 



24-27 



ESR 



0.5 



Melanoma 



CR 51:5449 
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24-27 



ESR 



0. 17 



24-27 



Ovary 



ESR 14 5 °* 64 Ovary CR 50:2724 



CR 55:2169 



0-6 7 Ovary BJC 67:551 



24-27 



15 



10 



24-27 



ESR 



1 



IP} 



Pancreas 



GCC 3:468 



24-27 



ESR 



16 



Unknown 



D6S41S 



Stomach C R 5 1:2926 




0.41 



0.09 



Ovar y r _ c L cc: ., 15 '?jj_. 



HeadSNeck 



CR 54 : 1152 



25.2 



Unknown 



D6S255 11 2 



0.1B 




D6S305 



40 



16 



Unknown 



0.4 Colon 



CR 56:145 



D6S305 



29 



Unknown IGF2R 16 11 

Unknown IGF2R 4 l 



Melanoma CR 56:589 



0.69 



Liver 



O 10:1725 



0.25 



Ovarv 



Unknown IGF2R i: 



0.^27 ^^EV 
0.11 Stomach 



CR 55:2169 
CR 55:2169 




Ovary 



CR 52:5815 



Unknown 



SOD2 



8 



Unknown D6S264 32 



13 



0.5 



Ovary 



BJC 67:551 



CR 56:145 



WO 98/41648 
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Chromosome 6 - q Arm 



: Unknown,;;: 
Unknown 



D6S264 15 5 



: .^;^;;:.y-v : ;:v:.EQdacriiig 



0.33 



Head&Neck 



Unknown 

21-o ter D6S 2 8 3 



CR 54:1152 



„ D6S2 64 34 12 0.35 Ovarv GCC 15:223 

0.38 Colon GCC 3:466 



21-qter D6S2 5 3 0.6 Ovar 

21-qter D6S2 1 l i Pancreas 



0 5:219 
GCC 3:468 



Unknown 
Unknown 



mm: 



D6S133 



22 




14 



0.64 



Ovary 



BJC 67:551 



mm 



38 



SB 



23 



0.61 



Unknown 



D6S297 



Ovary 



£2% 



GCC 15:223 



27 



mm 



14 



0.52 



Ovary 



GCC 15:223 



0-07 Brea st C R 53:4356 

0.14 Ovary IJC 54:54 6 




Unknown 
Unknown 



Unknown 



BJC 67:551 



D6S37 



D6S37 



14 



Unknown D6S37 29 

27 D6S446 24 



mm 

2 



ii 



21 
21 



27 



D6S281 



D6S281 



39 



13 



0.21 
0.07 

0.19 
0.33 



CR 52:2419 



Sarcoma 



O 9:2245 



Testis 
Ovary GCC 15:223 



Br east 



Ovary 



BJC 71:290 
GCC 15:223 



WO 98/41648 
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Chromosome 6 - q Arm 





■ :'■ ; ^£ 1; &tf a ksd wti : : ^ ^ 


,p2f 








27 


D6S193 


29 


8 


0.28 


Breast BJC 71:290 

li JPHI... ii ij n, ,n U . 


14-15 


D6S330 


12 


6 


0.5 


Breast BJC 71:290 












21-23.3 


D6S357 


20 


2 


°- 1 Breast BJC 71:290 


14-16 


D6S39 


1 






nreaSw CR 53:3804 














25.1 


ER 


47 


9 


0.19 
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22 D7S21 19 1 0.05 Stomach HG 9?-?44 

Unknown D7S517 6 0 0 Kidney PNAS 92-2854 
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0.07 Liver JJCR 81:108 



21.3-22.1 



C0LIA2 



Neuroblastom CR 49:1095 



21.3-22_. 1 C0LIA2 6 0 
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39 9 0.23 Liver GCC 10*7 
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CR 52:2478 



J?!!?^ ™ 1 1 1 Lunq CR 52:2478 

..-^^l^^^^^Sl?.?^. _ 7 0 0 Prostate GCC 3:215 
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12-13.1 D19S75 19 3_ 0.16 Brain "cR 54 :1397 



JSte.^JiSSl™ 6 2 0.33 Brain AJP 145:1175 

~»J&d«» M »-J&S3g 19. 3 0.16 Breast CP. 50:7184 

Unknown D19S223 24 3 o7l2 Cervix CR 56™ 197 



...^J 3 ^ 2 ^_^_„_ AP °C2 17 1 Q.06 Colon 

Unknown D19S210 18 *""*T* oToT"* EnoTcrine CR 56:599 



CCG 48:167 



Unknown 



Unknown 



D19S210 22 

mmm 



D19S255 



10 



0.32 Head&Neck 
0 Head&Neck 



CR 54:1152 



CR 54:4756 
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Chromosome 19 - q Ann 



: ■■.Unknown::;!^ 












Unknown 


D19S210-D19S224 


19 


0 




Kidnev 


PNAS 92:2854 








' ~ ' I'^'IH' ; h i « ■ ^ 








Unknown 


D19S225 


3 


0 


0 


Kidney 


PNAS 92:2854 




*Mk£z£ #££$22 












13.4 


D19S22 


24 


11 


0.46 


Lung 


CR 52:247B 














13.4 


D19S22 


1 


1 


l 


Lung 


CR 52:2476 














Unknown 


D19S225 


22 


0 


0 


Melanoma 


CR 56:589 
















Unknown 


CYP1 


7 


1 


0.14 


Ovary 


CR 50:2724 














12-13. 1 


D19S49 




3 


0.23 


Ovary 


BJC 69:429 














Unknown 


D19S8-CYP2A 


23 


4 


0.17 


Ovary 


CR 53:2393 










■ ;.v;i;;;^;l^ : ;- : 


^ £ rostrated 




13.4 


D19S22 


9 


3 


0.33 


Sarcoma 


CR 52:2419 


12 


D19S7 


19 


2 


0.11 


Testis 


O 9:2245 




'Fit, : .>■ "i|m*QG2 k "'i v''.-!'^- ; 












SUM 




1066 


323 


0.3 
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Chromosome 20 - p Arm 



Band Marker 


Total 


Caaaa with LOB LOR Fraquancy 


Tumor Typa Rafaranca 










Unknown Unknown 


12 


1 0.08 




Brain CR 50:5784 










Unknown D20S19 


6 


0 0 




Breast CR 53:3804 






12 D20S6 


20 






Breast GCC 2:191 












Unknown D20S19 




0 0 




Cervix GCC 9:119 












12 D20S6 


26 






Cervix CR 54 : 4481 












Unknown D20S95 


16 


0 0 




Endocrine CR 56:599 








Unknown D20S72 


20 


2 0.1 




Esophageal CR 54:2996 






Unknown D20S104 




2 0.09 




Head&Neck CR 54 : 4756 








m 




Unknown D20S104 


17 


1 0.06 




Kidney PNAS 92:2854 












Unknown D20S117 


5 


0 0 




Kidney PNAS 92:285 4 












Unknown D20S19 


29 


1 0.03 




Kidney CR 51:820 












Unknown D20S19 


40 


6 0.2 




Lung CR 52:2478 










12 D20S6 


2 


0 0 




Neuroblastom CR 49:1095 
a 


Unknown D20S19 


32 


"4 0.12 




Ovary CR 51:5118 












12 D20S6 


27 


4 0.15 




Ovary IJC 54 : 546 












12 D20S5 


2 


0 0 




Pancreas CR 54 : 2761 












Unknown D20S19 


8 


2 0.25 




Sarcoma CR 52:2419 












Unknown D20S19 


| 15 


3 0.2 




Stomach CR 52:3099 


Unknown D20S19 2 


i^V^ " ! vl ! !<■ : "Sfr ^ ' 

0 1 """"" "' — — 




Uterus GCC 9:119 




MI 








12 D20S6 


4 


1 0.25 




Uterus CR 51:5632 
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Chromosome 20 - q Arm 



Band Marker Total 


Cases w/LOH 


Ijuh rroq . 


Tumor Type 


Reference 














Unknown Unknown 


20 






0 


Brain 


^^•^ < ^V,|tV:<•^, : ,3,4< 4Z?4//i<^-- :] 

CR 50:578 4 




■viT ilHi-T 










Unknown D20S119 


26 




3 


0 12 


Cervix 


CR 56 : 197 














£R&54>v 4 4 81 1 


Unknown D20S25 


25 






o 


Endocrine 


CR 5 6: 599 














Unknown D20S100 


18 




1 


0.06 


Head&Neck 


CR 54:4756 














&fr,CRs54 v4.7Si6:J^;j 
CR 54:1152 


Unknown D20S110 


16 




l 


0.06 


Head&Neck 














PNAS 92:2854 


Unknown D20S100 






0 




Kidnev 




WW* 








BJC 64:1083 


Unknown Unknown 


5 




i 


0.2 


Liver 
















13.2 D20S4 


4 






0 


Liver 


CCG 48:72 








Sii 








13.2 D20S4 


10 




4 


0.4 


Lung 


PN 86:5099 
















13.2 D20S4 


6 




2 


0.33 


Luna 


PN 86:5099 


Unknown D20S19 


33 




0 


0 


Ovary 


IJC 54 :546 












: ; ;i;.iOvary^;' ;-;- ; : V 


BJC 69:429 


Unknown D20S4 6 


14 




3 


0.21 


Ovary 
















13.2 D20S4 


e 




0 


0 


Prostate 


G 11:530 






'^V*. !T v? 






J^Stoaach'^/ - . ? 




Unknown D20S19 


31 




0 


0 


Testis 


0 9:2245 
















13.2 D20S4 


36 




4 


0.11 


Testis 


0 9:2245 


SUM 


509 




38 


0.07 




:v;CR'o54^4234.-rU:i 
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Chromosome 21 - q Arm 

SUM 692 90 0.13 
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Chromosome 22 - q Arm 



Unknown 



Unknown 



Mar , ker Total Cases w/LOH LOH Froq. Tumor Tvtwj 

BCR 2 0 0 CGC 53:271 



0 0 Erain CR 50:6783 



11.2-12 



322§J 

11.1-11.2 D22S10 
Unknown D22S15 6 

iaisi 



D22S1 



11.2 



U nknow n 
Unknown 



D22S20 
D22S24 



0.2 Rrain C GC 53:271 



iiiMiMiiil«lii»iig 



Bra ir. CR 53:2386 ' 

Brain CGC 66:1 17 



D22 S25B 

Unknown D22S29 3 



CR 50:6783 



JL 0.06 Br ain CR 54:1397 

2 0.67 3rain CR 50:6783 



mm 



^jS&SSLmm-i ° 22S32 ,„ , ,*< 1 -0 -0^ B rain 0 ^ 4 9: 6572 

D22s80 -1 0 0 Brain CGC 66:117 " 



Unknown D22S9 1 0 0 Brain CCC fifi-Tl7 

Unknown IGLV i n n D rair m c,n c-tq-s 

: _ ;OT iJ^^ 5 0 0 Brain CGC 66:117 

l^.J-13.1 PDGFB 11 1 Rrair rrr 

Unknown D22S10_ 16 4 o 25 Ereas*- GCC ^'191 

Unknown D22S9 24 __4 ^_ _ 0.17 B reast G CC 2:191 



i^.^- 2 .....^JB 22 ^- ,.. 2 Z .... 2 °- 07 Cervix CR 54:4481 

-JJJSiSSSL*-^^ 3 0.15 Cervix CR 56:197 

ll :}- 12 , D 2 2S1 11 1 0.09 Colon 

11.1-11.2 D22S10 12 0 o Colon 




mm 

Unknown D22S10 29 ii 0 . 38 Colon CR 50:7166 
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Chr omosome Arm LOH Freq. 
1 q 0.15 







0.22 




IMA 
















6 


q 


0.25 


















8 


q 


0.14 








9 


q 


0.47 








10 












11 


q 


0.26 








12 


q 










14 


P 


0.08 








15 


p 




.i..^a>rf^fTi.i.ft..rr : ¥^ii^;.'«. 






16 




0.17 








17 


p 


0 .44 






18 


p 








iiliiiii 


19 


P 


0,13 








20 


P 










21 


q 


0.13 
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1) Cyclins 



5" 



Validation: Deletion of CDC23 (Anaphase Promoting), a S. cerevisiae gene in the same biochemical 
family, is lethal. 



Name 



Variances 
Identified 



Chromosome Genbank 



Sequence 


1 


3p21 


U54 831 


1 


5q31 


M34065 


3 


lpl5.3-plS.l 


X62048 


2 


13 


U18291 


4 


Uql3 


M73554 


2 


6p21 


M90814 


1 


4 


XB7843 


2 


12 


U792S9 


1 


22 


D38076 


1 


10 


X56466 



9 CDC-25A 

10 CDC-25C 

524 Weel 

1043 CDC16HS 

1278 Cyclin Dl 

1280 Cyclin D3 

1298 Cyclin H Assembly Factor 

1445 Cycl in-Dependent Protein Kinase 

1450 RAN binding protein 1 

1523 14-3-3 PROTEIN TAU 



1) Cyclin dependent kinases/phosphatases 

Validation: Deletion of CDC28 (Cyclin Dependent Protein Kinase), a S. cerevisiae gene in the same 
biochemical family, is lethal. 



Variances Chromosome Genbank 
Identified Sequence 



1051 CDC28 protein kinase 1 2 17 XS4941 

1052 CDC28 protein kinase 2 1 9 X54 942 
1111 Protein phosphatase 1, catalytic 4 n M63960 

subunit, alpha isoform 

1388 M-PHASE INDUCER PHOSPHATASE 2 1 20 M81934 

1401 M-phase phosphoprotein, mpp6 5 7 X98263 

1) Cell Division Structural Proteins 

Validation: Deletion of CBF2 (Kinetochore Protein), a S. cerevisiae gene in the same biochemical family, 
is lethal. 

ID Name Variances Chromosome Genbank 

Identified Sequence 

20 MCM7 (Mini chromosome Maintainence 3 7q21.3-q22.1 U20980 

1246 Chromatin assembly factor- I p60 subunit 2 21 U20980 

1273 Chromosome segregation gene homolog CAS 1 20 U3 3 2 86 

1347 High-mobility group (nonhistone 5 13ql2 D63S74 

chromosomal) protein 1 

1487 Chromatin structural protein homolog 3 7 Y12790 
(SUPT5H) 

1607 Centromere protein B (80kD) 1 20pl3 X05299 



2) Uniporters 
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Validation: Deletion of SAT2(Osmotolerance), a S. cerevisiae gene in the same biochemical family, is 
lethal. 

ID Name Variances Chromosome Genbanx 

Identified Sequence 

1253 ATPase, Ca++ transporting, plasma 5 3p26-p25 XS3575 

membrane 2 

1255 ATP synthase, H+ transporting, 4 12pl3-qter X03559 

mitochondrial Fl complex, beta 
polypeptide 

1286 Putative Chloride Channel 1 13ql4 , 3 -q21 . 1 X8337B 

1337 Copper Transport Protein HAH1 1 5 U70660 

1407 Nuclear chloride ion channel protein 4 20 U93205 

(NCC27) 

1463 Sodium channel, voltage-gated, type I, 1 19ql3.l L16242 

beta polypeptide 

1505 Transient receptor potential channel 11 3 XB9066 

1521 Voltage -dependent anion channel 2 4 L06328 



2) Antiporters 

Validation: Proven essential in mammalian cells by tritium suicide selection experiments. 



ID Name 



Variances Chromosome OenbanX 
Identified Sequence 



1471 Solute carrier family 9 

{sodium/hydrogen exchanger) 

1250 ATPase, Na+/K+ transporting, beta 1 
polypeptide 

1251 ATPase, Na+/K+ transporting, beta 2 
polypeptide 

1605 Solute carrier family 4, anion 

exchanger, member 2 (erythrocyte 
membrane protein band 3 -like 1) 



1 Ip36.1-p35 M81768 

1 Iq22-q25 X03747 

2 17p M811B1 
2 7q35-q36 U62531 



3) Acyltransferase 

Validation: Essential for metabolic processes such as biosynthetic reactions and energy metabolism. The 
S. cerevisiae histone acetyltransferase PAT1 and the N-alpha acetyltransferase which 
acetylates the N-termini of proteins are essential for growth. 



Name 



Variances 
Identified 



Chromosome QenbanX 
Sequence 



1227 



1387 



Acetyl -Coenzyme A acyltransferase 
(peroxisomal 3 -oxoacyl -Coenzyme A 
thiolase} 

Lysophosphatidic acid 
acyltransferase-alpha 



3p23-p22 



X12966 



U56417 



3) Amino Acid Biogenesis 
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Validation: Deletion of PR01(Glutamate 5-Kinase), a S. cerevisiae gene in the same biochemical family 
is lethal. 



Variances 
Identified 



Chromosome GenbarOc 
Sequence 



1330 Glutamic-oxaloacetic transaminase 1, 
soluble (aspartate aminotransferase 1} 

1331 Glutamic-oxaloacetic transaminase 2, 
mitochondrial (aspartate 
aminotransferase 2) 

1447 Pyrroline-5-carboxylate synthetase 
(glutamate gamma -serai aldehyde 
synthetase) 



10q24 .l-q25.1 
16q21 

10q24.3 



M37400 
M22632 

X94453 



3) Amino Acid Transport 

Validation: There are ten essential amino acids in man, which must be transported across the plasma 
membrane for use in protein synthesis. 



ID 



Kama 



Variances 
Identified 



Chromosome 



GenbanX 
Sequence 



1581 Solute carrier family 3 (cystine, 
dibasic and neutral amino acid 
transporters, activator of cystine, 
dibasic and neutral amino acid 
transport), member 1 



2plfi . 3 



L11696 



3) Addition, removal, or modification of phosphate groups 

Validation: Deletion of CMD1 (Calmodulin), a S. cerevisiae gene in the same biochemical family, is 
lethal. 

ID Name Variances Chromoaome QenbanX 

Identified Sequence 

1269 Calcineurin A catalytic subunit 

1270 Calcineurin B 
13 51 CALRETICULIN PRECURSOR 
1432 SERINE/THREONINE PROTEIN PHOSPHATASE 

2B CATALYTIC SUBUNIT, BETA ISOFORM 
1476 Snk interacting protein 2-28 



2 8 S46622 

1 10q21-q22 M30773 

1 10q21-q22 M84739 

2 10 M29551 

1 U83236 



3) GDP Dissociation Inhibitors 

Validation: Deletion of GDI1(GDP dissociation Factor), a S. cerevisiae gene in the same biochemical 
family, is lethal. 

ID Mama Variances Chromosome Oenbank 

Identified Sequence 



144 8 RAB GDP DISSOCIATION INHIBITOR ALPHA 



2 



14q23-q24 D13988 
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3) Lactate Transport 

Validation: Genes required to maintain organic compounds at levels required for cell growth or survival. 

ID Name Variances Chromosome Genbank 

Identified Sequence 



1583 Solute carrier family 16 2 Ipl3.2-pl2 

{monocarboxylic acid transporters) , 
member 1 



L31801 



3) Polyamine Biosynthesis 

Validation: Inhibition of polyamine biosynthesis has antiproliferative effects as demonstrated by inhibitors 
of polyamine metabolism. 

ID Name Variances Chromosome Genbank 

Identified Sequence 

1587 Ornithine decarboxylase 1 2 2 p25 M16650 



3) Protein Glycosylation 

Validation: Deletion of DPMl(Dolichol-phosphate mannosyltransferase), a S. cerevisiae gene in the same 
biochemical family, is lethal. 

ID Name Variances Chromosome Genbank 

Identified Sequence 

1328 Glutamine-fructose-6-phosphate l 2 pl3 M90516 

transaminase 

1339 Heparan Heparan Heparan Heparan 2 io U36601 

N-deacetylase/N-sulf ©transferase- 2 
1434 UDP-GalNAc:polypeptide 3 18 U41514 

N -ace tylgalactosaminyl transferase 



3) Protein Kinase C 

Validation: Deletion of PKC1 (Protein Kinase C), a S. cerevisiae gene in the same biochemical family, is 
lethal. y 

ID Name Variances Chromosome Genbank 

Identified Sequence 

1440 Protein kinase C, beta 1 

1443 Protein kinase C-theta 

1444 Protein kinase C substrate 80K-H 



4 16pll.2 X0G318 

1 10pl5 L01087 

1 7 J03075 
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3) Protein Post-modification 

Validation: Deletion of BET2(Geranylgeranyltransferase), a S. cerevisiae eene in the same biochemical 
family, is lethal. 

ID Name „ . 

variances Chromosome Genbank 

Identified Sequence 

1081 geranylgeranyl transferase type II 2 i Coonn," 

beta-subunit *yaooi 



3) Sugar Biosynthesis and Processing 

Validation: Deletion of PGIl(Glucose-6-phosphate Isomerase), a S. cerevisiae gene in the same 
biochemical family, is lethal. 



ID 



Name 



Variances 
Identified 



Chromosome Genbank 



Sequence 


2 


9ql3 


X92493 


1 


22qll.21-ql3 .3 


U80040 


2 


18 


D14710 


3 


18 


XS0221 


5 


21q22.1-q22.2 


X83218 


5 


11 


AF001437 


5 


7q31-q32 


J03490 


3 


10q22 


M75126 


2 


15q26.1 


XS9433 


1 


2pl6 


X81900 


4 


18pll.31-pll.2 


U53468 


1 


18pll . 31-pll .2 


U65579 


3 


2 


X61100 


9 


3pl3-q23 


M34479 


1 


21q22.3 


M10O36 


3 


1,3 


M36647 


3 


lp22 . 1-qter 


D10245 


10 


5pl5 


D30648 


2 


lq21 


D10326 


6 


7pl4-pl3 


D10S23 


3 


17pll.2-pll.13 


D43682 



14 
1229 
1249 

1257 



1258 



1302 



1303 



1346 
1366 



1395 
1421 



1422 



1424 

1427 
1430 
1451 

1464 

1465 

1576 
1577 
1579 



PIP 5 Kinase beta 
Aconitase 2, mitochondrial 
ATP SYNTHASE ALPHA CHAIN, 
MITOCHONDRIAL PRECURSOR 
ATP synthase, H+ transporting, 
mitochondrial F0 complex, subunit b, 
isoform 1 
ATP synthase, 
mitochondrial 



H+ 
Fl 



transporting, 
complex, O subunit 
(oligomycin sensitivity conferring 
protein) 

Dihydrolipoamide S -acetyl transferase 
(E2 component of pyruvate 
dehydrogenase complex) 
Dihydrolipoamide dehydrogenase (E3 
component of pyruvate dehydrogenase 
complex, 2-oxo-glutarate complex, 
branched chain keto acid dehydrogenase 
complex) 
Hexokinase 1 

Isocitrate dehydrogenase 2 (NADP+) , 

mitochondrial 

NADH dehydrogenase 

NADH: ubiquinone oxidoreductase subunit 
B13 

NADH dehydrogenase -ubiquinone Fe-S 
protein B , 23 kDa subunit precursor 
( NDUFS 8 ) 

NADH - UB I QU I NONE OXIDOREDUCTASE 75 KD 
SUBUNIT PRECURSOR 

Pyruvate dehydrogenase (lipoamide) beta 
Phosphof ructokinase 
UBIQUINOL-CYTOCHROME C REDUCTASE 
COMPLEX 11 KD PROTEIN PRECURSOR 
Succinate dehydrogenase, iron sulphur 
Up) subunit 

Succinate dehydrogenase 2, 

flavoprotein (Fp) subunit 

Pyruvate kinase, liver 

Oxoglutarate dehydrogenase (lipoamide) 

Acyl- Coenzyme A dehydrogenase, very 
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long chain 

1584 Dihydrolipoamide S-succinyltransf erase 
1588 Acyl -Coenzyme A dehydrogenase, C-4 to 

C-12 straight chain 
1590 Pyruvate kinase, muscle 
1596 Phosphoglucomutase 1 
1603 Phosphof ructokinase, muscle 
1611 Enolase 3, (beta, muscle) 



3) Sugar Transport 

Validation: Genes required to maintain organic compounds at levels required for cell growth or survival. 

ID Name Variances Chromosome Genbank 

Identified Sequence 

1594 Solute carrier family 2 (facilitated 

glucose transporter) , member 5 
1598 Solute carrier family 5 

(sodium/glucose cotransporter ) , member 2 



5 14q24.3 L37418 

1 lp31 M1SB27 

4 15q22 M23725 

5 lp3l M83088 
4 12ql3.3 U24183 
1 17pter-pl2 X16504 



lp31 M55531 
16 M95549 



4) Protein Degradation 

Validation: Deletion of CDC48(Ubiquitin proteolysis), a S. cerevisiae gene in the same biochemical 
family, is lethal. 



ID 


Na3ne Variances 

Identified 


Chromosome 


Genbank 
Sequence 




1027 


2 6S PROTEASE REGULATORY SUBUNIT 4 


3 


14 


L02426 


1037 


CALPAIN 1, LARGE 


1 


11 


X04366 


1098 


Human mRNA for KIAA0123 gene, partial 


6 


9, 19 


D50913 




cds 






1114 


Proteasome (prosome, macropain) 
subunit, beta type, 6 


7 


9, 19 


D29012 


1115 


Human mRNA for proteasome subunit z, 


4 


9 


D38048 




complete cds 






1116 


PROTEASOME COMPONENT C13 PRECURSOR 


2 


9 


U17496 


1117 


Human mRNA for proteasome subunit 


6 


1 


D26599 




HsC7-I, complete cds 






1118 


Human mRNA for proteasome subunit 


2 


2 


D44466 




pll2, complete cds 






1119 


Human mRNA for proteasome subunit p27, 
complete cds 


1 


2 


AB003177 


1289 


ATP -DEPENDENT CLP PROTEASE PROTEOLYTIC 


2 


19 


Z50853 



SUBUNIT 



4) Protein Folding 

Validation: Deletion of HSPlO(Chaperonin), a S. cerevisiae gene in the same biochemical family, is 
lethal. 

ID Name Variances Chromosome Genbank 

Identified Sequence 
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1287 PEPTIDYL- PROLYL CIS-TRANS ISOMERASE, i " """"" 

MITOCHONDRIAL PRECURSOR M80254 
13 05 DNAJ PROTEIN HOMOLOG 2 

1358 DNAJ PROTEIN HOMOLOG HSJ1 9 ' 2 D13388 

2 9,2 X63368 



4) Ribosomal Subunit 

Validation: Deletion of GRC5(Ribosome), a S. cerevisiae gene in the same biochemical family, is lethal. 



10 



Name 



1127 
1128 
1130 
1131 
1133 
1135 

1136 

1137 
1138 

1139 
1140 

1141 
1142 
1143 
5 

1144 
1145 
1146 
1147 

1148 
1149 
1150 
1151 
1152 
1154 
1155 
1157 
115B 
1159 
1160 
1161 
1223 



Variances 
Identified 



H. sapiens mRNA for ribosomal protein Lll 

Ribosomal protein L17 

60S RIBOSOMAL PROTEIN L18A 

Ribosomal protein L19 

60S RIBOSOMAL PROTEIN L23A 

Human ribosomal protein L27a mRNA, 

complete cds 

Human ribosomal protein L28 mRNA, 

complete cds 

Ribosomal protein L32 

Human ribosomal protein L3 5 mRNA, 

complete cds 

Ribosomal protein L35a 

Human mRNA for ribosomal protein L3 9, 

complete cds 

Ribosomal protein L4 

Ribosomal protein L6 

Ribosomal protein L7 

Ribosomal protein L7A 

Ribosomal protein LB 

Ribosomal protein L9 

Ribosomal protein, large, PI 
Human ribosomal protein S10 mRNA, 

complete cds 
Ribosomal protein Sll 
40S RIBOSOMAL PROTEIN S15 
4 OS RIBOSOMAL PROTEIN S15A 
Ribosomal protein S16 
Ribosomal protein S17 
4 OS RIBOSOMAL PROTEIN S23 
Ribosomal protein S25 
Ribosomal protein S2 8 
40S RIBOSOMAL PROTEIN S2 9 
Ribosomal protein S5 
4 OS RIBOSOMAL PROTEIN S7 
Ribosomal protein S9 
Ribosomal protein L7a 



Chromosome GenbanX 
Sequence 


3 


9,2 


X79234 


2 


17,4 


X52839 


5 


3 


X80822 


1 


17qll 


X63527 


2 


17,18 


U43701 


3 


6, 11 


U14968 


11 


19 


U14969 


4 


20 


X03342 


3 


20 


U12465 


1 


3q29-qter 


X52966 


2 


3q29-qter 


U57846 


4 


3,6 


L20B68 


1 


12 


X69391 


1 


12 


L16558 


1 


19q33-q34 


M36072 


S 


12 


Z28407 


2 


12 


U09953 


5 


15,22 


M17886 




20 


U14972 


1 


19q 


X06G17 


2 


19q 


J02984 


2 


19q 


XB4407 


5 


19 


M60854 


5 


llpter-pl3 


M13932 


2 


5 


D14530 


2 


Ilq23 .3 


M64716 


2 


19 


U58682 


1 


19 


L31610 


2 


19 


U14970 


3 


19 


M77233 


3 


19 


U14971 


6 


9q34 


X52136 



4) T-Complex 

Validation: Deletion of CCT2(T-Complex), a S. cerevisiae gene in the same biochemical family, is lethal. 



ID Name 



Variances Chromosome QenbanX 
Identified Sequence 



WO 98/41648 



166 / 214 



PCT/US98/05419 



14 90 T-COMPLEX PROTEIN 1, EPSILON SUBUNIT 3 
1491 T-COMPLEX PROTEIN 1, GAMMA SUBUNIT 2 



4) Translation Elongation 

Validation: Deletion of CDC33(eIF4e), a S. cerevisiae gene in the same biochemical family, is lethal. 

ID Name Variances Chromosome Genbank 

Identified Sequence 

1063 Eukaryotic translation elongation 

factor 1 delta 
1073 Eukaryotic translation initiation 

factor 4A (eIF-4A) isoform 2 
1095 Human mRNA for KIAA0031 gene, complete 

cds 

1099 Human mRNA for KIAA0219 gene, partial 
cds 



5 D43950 
1 X74801 



3 7 221507 

1 18pll.2 D30655 

3 17,2 D21163 

3 12 D86973 



4) Translation Factor 

Validation: Deletion of CDC33(eIF4e) t a S. cerevisiae gene in the same biochemical family, is lethal. 

ID Name Variances Chromosome Genbank 

Identified Sequence 

1049 PEPTIDE CHAIN RELEASE FACTOR SUBUNIT 12 12 X81625 



4) Translation Initiation Factors 

Validation: Deletion of CDC33(eIF4e), a S. cerevisiae gene in the same biochemical family, is lethal. 



ID 


Name Variances 

Identified 


Chromosome 


Genbank 
Sequence 




1068 


Human translation initiation factor 
eIF-3 pllO subunit gene 


1 


16 


U46025 


1069 


EUKARYOTIC INITIATION FACTOR 4 A- LIKE 
NUK-34 


1 


17 


D21853 


1070 


Eukaryotic translation initiation 
factor 4C (eIF-4C) 


3 


i,x 


L18960 


1072 


Eukaryotic translation initiation 
factor 2A 


2 


14 


J02645 


1074 


Eukaryotic translation initiation 
factor 4E 


3 


14 


M15353 


1312 


Translation initiation factor 3 


1 


12 


U39067 



(eIF-3) p36 subunit 



4) tRNA Synthetases 



WO 98/41648 
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Validation: Deletion of ALA 1 ( Alanyl-tRN A synthetase), a S. cerevisiae gene in the same biochemical 
iamily, is lethal. 



ID 



Nana 



Variances 



Chromosome Genbank 
Sequence 


2 


16q22 


D32050 


1 


llplS.S 


L06645 


2 


7pl5 


U09510 


2 


9q21 


D28473 


3 




M27396 


3 


5pter-qll 


S80343 


1 


5pl3-cen 


M63180 


4 


9 


X59303 


1 


14 


M61715 



1031 Alanyl-tRNA synthetase 

1040 Cysteinyl-tRNA synthetase 

1079 Glycyl-tRNA synthetase 

1090 Isoleucine-tRNA synthetase 

1102 ASPARAGINE SYNTHETASE 

1121 Arginyl-tRNA synthetase 

1198 Threonyl-tRNA synthetase 

121B VALYL-TRNA SYNTHETASE 

1221 TRYPTOPHANYL- TRNA SYNTHETASE 



4) Ubiquitin and Ubiquitin Associated 

gene in the same biochemical family, is 



Validation: Deletion of UFD1 (Ubiquitin Fusion), a S. cerevisiae 
lethal. 



ZD 



Name 



Variances Chromosome GenbanJc 



1309 
1315 

1362 
13G3 
1420 
1431 

1511 

1514 
1515 



Ubiquitin carrier protein (E2-EPF) 
Cyclin-selective ubiquitin carrier 
protein 

UBIQUITIN CARBOXYL- TERMINAL HYDROLASE 3 
UBIQUITIN CARBOXYL -TERMINAL HYDROLASE T 
UBIQUITIN CARBOXYL- TERMINAL HYDROLASE 14 
UBIQUITIN CARBOXYL -TERMINAL HYDROLASE 
ISOZYME LI 

Ubiquitin A- 52 residue ribosomal 
protein fusion product 1 
Ubiquitin-conjugating enzyme E2I 
Ubiquitin fusion-degradation protein 
(UFD1L) 



Sequence 


2 


17 


M91670 


2 


17 


U73379 


2 


14 


D80012 


1 


12 


X91349 


4 


13 


M6B864 


2 


4 


X04741 


1 


19pl3 .l-pl2 


S79522 


6 


16pl3.3 


U45328 


4 


18 


U64444 



5) DNA Helicases 

Validation: Deletion of DNA2(DNA Helicase), a S. cerevisiae gene in the same biochemical family, is 

lethal. J 

ID Name - r . 

variances Chromosome GenbanX 

Identified Sequence 

1050 Human CHLl potential helicase (^HLRl)" 3 \V ~ m lll~-> 

complete cds 18 U33833 

1057 ATP - DE P ENDENT DNA HELICASE II, 86 KD 1 , M , no , B 

SUBUNIT 2 M3093B 



III' S^rsSn'^ helicase Q1 - like) ; »p«-p» 



L36140 
12 X86691 



5) DNA Polymerase 
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Validation: Deletion of POL2(DNA pol epsilon), a S. cerevisiae gene in the same biochemical family, is 



lethal 

ZD Name 



Variances Chromosome Genbank 
Identified Sequence 



1059 Human DNA polymerase delta small 3 " 

subunit mRNA, complete cds 12 U21090 
1105 DNA polymerase alpha subunit 1 

1 X,ll L24559 



5) DNA Replication 

Validation: Deletion of CDC45 (Chromosomal DNA Replication), a S. cerevisiae gene in the same 
biochemical family, is lethal. 



ID 



Name 



Variances Chromosome GenbanJc 
Identified 



Sequence 


1 


4 


D55716 


2 


3 


X67334 


2 


4pl4-pl3 


L14922 


2 


20ql2-ql3 .1 


J03250 


2 


17q21-q22 


J04088 


1 


17q21-q22 


D38073 


2 


lp3 5 


J05249 



1048 DNA REPLICATION LICENSING FACTOR CDC47 
HOMOLOG 

1094 Human mRNA for KIAA0030 gene, partial 
cds 

1124 Replication factor C (activator 1) 1 

(14SkD) 
1208 DNA topoisomerase I 

22 Topoisomerase II 
1222 Mini chromosome maintenance deficient 

(S. cerevisiae) 3 
1461 Replication protein A2 (32kD) 



5) Histone 



Validation: Deletion of CSE4(Similar Histone H3), a S. cerevisiae 
is lethal. 



gene in the same biochemical family, 



ID 



Name 



Variances 
Identified 



Chromoac 



GenbanJc 





Histone HI (0) 








1336 


Histone Hlx 


3 


22 


X03473 


1341 


HISTONE HID 


3 


22 


D64142 


1342 


HISTONE H2A.1 


5 


6 


XS7129 


1343 


Histone H2A.2 


4 


G 


U90551 


1344 


Histone H2B.1 


1 


6 


L19779 


1345 


H4 histone 


1 


1 


M60756 






1 


1 


X60486 



5) Polyadenylation and 3' Cleavage 

Validation: Deletion of FIP1 (Polyadenylation Factor), a S. cerevisiae gene in the same biochemical 
family, is lethal. 



ID Name 



Variances Chromosome GenbanJc 
Identified Sequence 



WO 98/41648 



169 / 214 



PCI7US98/05419 



1053 Human cleavage and polyadenylation 

specificity factor mRNA, complete cds 
1349 HNRNP METHYLTRANSF ERASE 
1426 Poly (A) -binding protein-like 1 



5) Purine/Pyrimidine Biosynthesis 

Validation: Deletion of CDC8(Thymidylate Kinase), a S. cerevisiae gene in the same biochemical family 
is lethal. 3y 



ID 



Name 



Variances 
Identified 



Chromosome Genbank 
Sequence 



1235 
1268 
1293 
1326 



1437 



1S10 
1S17 



1518 



ADENYLOSUCCINATE LYASE 
CAD PROTEIN 
CTP synthetase 
Phosphoribosylglycinamide 
f ormy 1 t rans f e r a s e , 

phosphoribosylglycinamide synthetase, 
phosphoribosylaminoimidazole synthetase 
Phosphoribosyl pyrophosphate 
amidotransf erase 
Thymidylate synthase 
Uridine monophosphate synthetase 
(orotate phosphoribosyl transferase 
and orotidine-5 1 -decarboxylase) 
Uridine Phosphorylase 



1 
2 

lp34.1 
21q22.1 



4ql2 

lBpll.32 
3ql3 



XS5867 
D78586 
X52142 
X54199 



U00238 



X02308 
J03626 



X90858 



5) Ribonucleotide Reductase 

Validation: Deletion of RNR1 (Ribonucleotide Reductase), a S. cerevisiae gene in the same biochemical 
family, is lethal. 

ID Name Variances Chromonoma GenbanJc 

Identified Sequence 

1452 RIBONUCLEOSIDE-DIPHOSPHATE REDUCTASE 4 n X59543 

Ml CHAIN 



5) RNA Helicase 

Validation: Deletion of BRR2(RNA Helicase), a S. cerevisiae gene in the same biochemical family, is 
lethal. 



ID Name 



Variances Chromosome Genbank 
Identified Sequence 



1100 Human mRNA for KIAA0224 gene, complete 4 16 D86977 

cds 

1163 DEAD/H (Asp-Glu-Ala-Asp/His) box l i L13848 

polypeptide 9 (RNA helicase A) 
1484 PUTATIVE ATP - DEPENDENT RNA HELICASE 3 19 U90426 

STE13 



WO 98/41648 
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5) RNA Polymerase II Components 
Valid*™ ft**, of RPAI35(RNA po, Subumt), a S. cere»i S i,e g,„e „ ^ « biochemica| ^ 



ID Nana 

Variances 



1026 Homo sapiens (clone mf.18) RNA 

polymerase II mRNA, complete cds 
108B Human RNA polymerase II subunic , 

(hsRPBlO) mRNA, complete cds 

1109 RNA polymerase II, polypeptide C (33JcD) 3 

1110 Polymerase (RNA) II (DNA directed) x 
polypeptide A <220kD) 

1165 DNA -DIRECTED RNA POLYMERASE II 23 KD o 
POLYPEPTIDE 3 
13 60 RNA polymerase II subunit hsRPB7 x 



3 



s GenbanJc 


19 


L37127 


19 


U37690 


16ql3-qq21 


J05446 


17pl3.1 


X63564 


17pl3.1 


J04965 


11 


U20659 



5) RNA Polymerase III 
VaUdatic: n*ta of RPA135(RNA po! Sub™,,. . S. cerevisiae gene in fc ^e biochercic,, 

ID Name 

Variances Chromosome OenbanJc 
beatified Sequence 

1170 Human RNA polymerase III subunit""" ~, " 

(RPC62 ) mRNA, complete cds 11 U93aG7 

5) RNA Splicing/Processing 

Validation: DdeUon of CUS1(U2 snRNP protein), a S. cerevisiae gene in the same biochemical family, is 

ID Nam* 

Variances Chromosome GenbanX 
Identified Sequence 

1171 Human spliceosome associated protein ~ " 
(SAP 145) mRNA, complete cds 

1172 Human splicesomal protein (SAP 61) 
mRNA, complete cds 

1176 H. sapiens mRNA for splicing factor 
SF3al20 

^11 * pUcing factor ' arginine/serine-rich 2 
1181 Human splicing factor SRp30c mRNA, 

complete cds 
1183 PRE -MRNA SPLICING FACTOR SRP75 
1216 SPLICING FACTOR U2AF 65 KD SUBUNIT 
1224 Human (clone E5.1) RNA-binding protein 

mRNA, complete cds 
1322 Fibrillarin 
1354 Heterogeneous nuclear 

ribonucleoprotein K 



1 


2 


U41371 


3 


2 


U08815 


1 


22 


X85237 


2 


4,17 


M90104 


1 


6 


U30825 


2 


1 


L14076 


1 


1 


X64044 


4 


1 


L37368 


1 


1 


X56597 


1 


9q2l.32-q21.33 


S74678 



WO 98/41648 
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1455 


Ul SMALL NUCLEAR RIBONUCLEOPROTEIN A 


3 


9q21.32-q21.33 


X0S347 


1460 


Ul small nuclear RNP- specific C 


2 


15 


X12517 


1473 


SnRNP core protein Sm D3 


2 


22 


U15009 


1474 


SnRNP core protein Sm D2 


5 


22 


U15008 


1477 


Ul snRNP 70K protein 


3 


19ql3 .3 


M22636 


1478 


Small nuclear ribonucleoprotein 


3 


20 


J04564 




polypeptides B and Bl 








1480 


Small nuclear ribonucleoprotein 


5 


15ql2 


U41303 




polypeptide N 







5) TATA-Binding Proteins 

Validation: Deletion of TAF145(TAFII Complex), a S. cerevisiae gene in the same biochemical family is 
lethal. 

ID Name Variances Chromosome GenbanX 

Identified Sequence 

1193 H . sapiens mRNA for transcription 

factor TFIID subunit TAFII28 
1196 Human TFIID subunit TAFII55 (TAFII55) 

mRNA, complete cds 
1199 TATA box binding protein 
1361 TBP-associated factor (hTAFII130) 



1 6 X8392B 

1 5 U18062 

2 6q27 M55654 
1 20 U7530B 



5) Transcription Elongation Factors 

Validation: Deletion of RP021(RNA pol Subunit), a S. cerevisiae gene in the same biochemical family, is 
lethal. 

ID Name Variances Chromosome Genbank 

Identified Sequence 

1077 TRANSCRIPTION ELONGATION FACTOR S-II 4 6 M81601 

4 TRANSCRIPTION ELONGATION FACTOR B3 5 5q31 L34587 

32 Elongin TCEB1 3 lp36.1 L4734 5 



5) Transcription Factors 

Validation: Deletion of BBPl(BFRlp binding), a S. cerevisiae gene in the same biochemical family, is 
lethal. 



ID 


N*»e Variances 

Identified 


Chromosome 


Oenbank 
Sequence 




33 


SUPT6H 


3 


17qll.2 


U46691 


1202 


Human TFIIA gamma subunit mRNA, 


1 


15 


U14193 




complete cds 








1205 


General transcription factor TFIIE 


1 


8p21-pl2 


X63469 




beta subunit, 34 JcD 






1206 


TRANSCRIPTION INITIATION FACTOR IIF, 


1 


8p21-pl2 


X16901 




BETA SUBUNIT 






1247 


CYCLIC -AMP -DE PENDENT TRANSCRIPTION 


1 


19pl3 .3 


X55544 




FACTOR ATF-1 






124B 


CAMP -dependent transcription factor 


3 


2 


M86842 
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1274 
1292 
1368 
1373 

1411 
1483 
1496 



1497 



1498 



1500 
1501 



ATF-4 (CREB2) 

Transcription Factor (CBFB) 
CRM1 protein 

Transcription Factor IL-4 Stat 
SIGNAL TRANSDUCER AND ACTIVATOR OF 
TRANSCRIPTION 1- ALPHA/ BETA 
Nuclear Factor I-B2 (NFIB2) 
Transcription Factor StatSb 
Transcription factor 12 (HTF4 , 
helix-loop-helix transcription factors 
4) 

Transcription factor 3 (E2A 
immunoglobulin enhancer binding 
factors E12/E47) 
Transcription factor 6 -like 1 
(mitochondrial transcription factor 
1-like) 

TRANSCRIPTION FACTOR P65 
Transcription factor COUP 2 (a.k.a. 
ARP1) 



2 
2 

21q21-q22.1 
21q21-q22.1 

19 
17 
15q21 



19pl3.3 



7p 



11 

15q26.1-q26.2 



L20298 
Y08614 
U16031 
M97935 

U85193 
U48730 
M83233 



M31523 



M62810 



L19067 
X91504 



6) Clathrin 

Validation: Deletion of RET 1 (Alpha-Cop), a S. cerevisiae gene in the same biochemical family, is lethal. 



Nam* 



Variances 
Identified 



Chromosome Genba&k 
Sequence 



1242 CLATHRIN COAT ASSEMBLY PROTEIN AP47 

1243 CLATHRIN COAT ASSEMBLY PROTEIN AP50 
1282 cell surface protein 

1290 Clathrin, light polypeptide (Lcb) 

1291 Clathrin heavy chain 



3 
22 

4q2-q3 
17qll-qter 



D38293 
U36188 
X83545 
M20470 
U417G3 



6) Cytoskeleton 

Validation: Deletion of MHP1 (Microtubule Interacting), a S. cerevisiae gene in the same biochemical 
family, is lethal. 



ID 



Name 



Variances 
Identified 



Chroxnosox 


oe Genbank 
Sequence 




8 


17pll-qter 


X04098 


1 


21 


U61 


4 


20 


S65738 


3 


2cen-q24 


M94345 


2 


7 


U03851 


1 


2q35 


J03191 


1 


9q34 


X04412 


5 




Y00503 


13 


5,12 


J00269 


1 


2 


X82207 


5 


llql3 


X95404 


1 


20 


M13451 


1 


lq42.1 


L25931 



1032 

1038 

1039 

1056 
10B0 
1092 
1093 
1267 
1284 
1383 
1385 



Ac tin, gamma Subunit 

Sh3pl7(Myosin IC Heavy Chain) 

Actin depolymerizing factor [human, 
fetal brain, mRNA, 1452 nt] 
Capping protein (actin filament), 
gelsolin-like 

Human capping protein alpha mRNA, 

partial cds 

Desmin 

Gelsolin (amyloidosis, Finnish type) 
Keratin 19 

KERATIN, TYPE II CYTOSKELETAL 6D 

BETA- CENTRACTIN 

Cofil in 1 (non-muscle) 

LAMIN A 

Lamin B receptor 
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1386 

1404 
1405 
140S 
1486 
1495 
1507 
1508 
1520 



MYOSIN LIGHT CHAIN ALKALI, NON-MUSCLE 
ISOFORM 

MYOSIN HEAVY CHAIN 95F 
MYOSIN HEAVY CHAIN IB 
Myosin- IC 

SUPPRESSOR OF TUBULIN STU2 
MICROTUBULE-ASSOCIATED PROTEIN TAU 
Tubulin, gamma polypeptide 
TUBULIN ALPHA- 4 CHAIN 
Myosin VI IA (USH1B) 



12,17 

4pl6.3 
13 
13 
11 
17 
17 
17 
17 



M22920 

U9023G 
D63476 
U14391 
X92474 
J03778 
M61764 
X06956 
U39226 



6) ER Protein 

Validation: Deletion of BETl(v-SNARE), a S. cerevisiae gene in the same biochemical family, is lethal. 

ID Name „ . _ 

variances Chromosome QenbanX 
_ _ _ Identified Sequence 

1272 Calnexin " 

1317 ER LUMEN PROTEIN RETAINING RECEPTOR 2 x 

1614 Ribophorin I a M88458 

1615 Ribophorin II ? 3c * Y00281 

1 20ql2-ql3.1 Y00282 



6) Integrin 

Validation: Deletion of MY02(Myosin Heavy Chain), a S. cerevisiae gene in the same biochemical 
family, is lethal. 

ID Name Variances Chromosome Genbank 
Identified Sequence 

1378 Integrin alpha-3 subunit [ ™™ ™™ 



6) Karyopherin 

Validation: Deletion of KAP1 21 (Karyopherin), a S. cerevisiae gene in the same biochemical family is 

lethal. J 

ID Name „ . ^ 

variances Chromosome QenbanJc 

Identified Sequence 

1091 karyopherin alpha 3 " " 

1214 transports (TRN) ? " D " 61B 

1 13 U70322 



6) Lysosomal Proteins 

Validation: Essential for sequestering and degrading aged or defective organelles and polymers that can 
interfere with cell survival, proliferation as seen by human diseases such as Tay-Sachs 
disease. 



WO 98/41648 
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ID Name „ . 

variances Chromosome GenbanJc 
Identified Sequence 

1265 ATPase H + transport ing/l'^c.^ \ Vte*x"vVl Xlllll 

(vacuolar proton pump) 31kD ^pter q.1.2 X76228 



6) MITOCHONDRIAL IMPORT 

Validation: Genes required to maintain inorganic ions at levels compatible with cell growth or survival. 

IP Name 

Variances Chromosome Genbank 
Identified Sequence 

1578 MITOCHONDRIAL IMPORT RECEPTOR SUB UN IT a 

TOM20 1 D13641 



6) Nuclear Pore Complex 

Validation: ^™rf^l(Ni«d« Pore Trafficking), a S. cerevisiae gene in the same biochemical 



ID 



Name 



Variances Chromosome GenbanJc 



Sequence 


3 


9 


D14689 


3 


llplS 


U41815 


4 


20 


L38696 


4 


12ql3.1 


X79536 


3 


6 


225535 


1 


11 


X58521 


5 


20 


U84720 


3 


12 


M29063 


5 


10 


D21262 



28 
29 
1266 

1350 

1355 
1425 
1449 
1454 

1524 



Nuclear Pore Complex NUP214 

Nucleoporin 98 

HETEROGENEOUS NUCLEAR 

RIB0NUCLEOPR0TE IN C 

Heterogeneous nuclear 

ribonucleoprotein Al 

Nuclear pore complex protein hnu P 153 

NUCLEAR PORE GLYCOPROTEIN P62 

Export protein Rael 

HETEROGENEOUS NUCLEAR 

RIBONUCLEOPROTE2NS C1/C2 

140 KD NUCLEOLAR PHOSPHO PROTEIN 



Validation: Deletion of BET3(v-SNARE associated), a S 
is lethal. 



6) Protein Transport 

cerevisiae gene in the same biochemical family, 



ID 



Name 



Variances 
Identified 



Chromosome Genbanx 
Sequence 



8 Integral Transmembrane Protein 
1467 Sec23A isoform 

1608 Signal recognition particle receptor 

{ 'docking protein' ) 
1613 TIM17 preprotein translocase 



llq23-24 
14 

Ilq23-q24 



L38961 
X97064 
X06272 

X97544 



6) Syntaxin 



WO 98/41648 
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Validation: Deletion of SED5 (Syntaxin), a S. cerevisiae 

ID Name 



gene in the same biochemical family, is lethal. 



Variances Chromosome Cenbank 
Identified Sequence 



lias syntaxin 1A 

1188 syntaxin 3 

1189 Syntaxin 5A 

1190 syntaxin 7 



21q22.1 
11 
11 
6 



L37792 
U32315 
U26648 
U77942 



6) Vacuolar Protein 

Validation: Ddenon of PPAl(Vacuo.ar H-ATPase), a S. cerevisiae gene in the same biochemical family. 



is lethal. 

ID Name 



Variances Chromosome Oenbank 
Identified Sequence 



1261 Vacuolar H + ATPase proton channel 2 1 

subunit 6 M62762 



6) Vesicle Proteins 



Validation: Deletion of SARl(COP H), a S. cerevisiae gene in the same biochemical family, is lethal. 



ID Name 



24 
1055 
1082 

1173 
1174 

1184 
1217 



Variances Chromosome Cenbank 
Identified Sequence 



1025 Human (chromosome 3p25) membrane 
protein mRNA 
COATOMER BETA SUBUNIT 
COATOMER DELTA SUBUNIT 
Human GP36b glycoprotein mRNA, 
complete cds 

SEC14 (S. cerevisiae) -like 
Human homologue of yeast sec7 mRNA, 
complete cds 

Human chromosome 17q21 mRNA clone LF113 
H. sapiens mRNA for vacuolar- type 
H(+) -ATPase 115 kDa subunit 



3,18 

3 

11 
5 

17q25.1-q25.2 
17q25.1-q25.2 

17 
17 



L09260 

X70476 
XB1198 
U10362 

D67029 
M85169 

U18009 
Z71460 



99) Direct Essential Yeast Homolog 

Validation: Deletion of the S. cerevisiae homologue of this gene is lethal. 



ID 



Name 



Variances Chromosome Oenbank 
Identified Sequence 



1238 
1239 
1241 
1271 

1300 



Aldolase A 

Aldolase B, f ructose-bisphosphate 
S-adenosylmethionine decarboxylase 1 
Calmodulin 1 (phosphorylase kinase, 
delta) 
DED81 



16q22-q24 
9q22 
6q21-q22 
14q24-q3l 

18 



M11560 
X02747 
M21154 
D45B87 

U79254 
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1301 
1306 

1318 
1332 
1333 
1359 
1367 
1372 
1396 
1399 
1414 
1415 
1433 
1446 
1519 



Deoxyhypusine synthase 
Dolichol monophosphate mannose 
synthase (DPMI) 
ESS1 PROTEIN 

Glucose phosphate isomerase 

Guanylate kinase (GUK1) 

Heat shock 60 kD protein 1 (chaperonin) 

PERIODIC TRYPTOPHAN PROTEIN 1 

IPP isomerase 

N-acetylglucosaminyl transferase I 
Mannose phosphate isomerase 
Nipl 

GLYCYLPEPTIDE N - TETRADECANO YLTRANS FE RAS E 
PHOSPHATIDYL INOSITOL 4 -KINASE ALPHA 
PERIODIC TRYPTOPHAN PROTEIN 2 
Uridine diphosphoglucose 
pyrophosphorylase 



3 


upx J . 1 1 -pi 3 . 12 


L3 9068 


2 


20 


AF007B75 


1 


19 


U49070 


1 


19ql3,l 


K03515 


3 


19ql3 .1 


L76200 


1 


9 


M34664 


1 


12 


L07758 


1 


10 


X17025 


4 


5q3l.2-q31.3 


M55621 


3 


15q22-qter 


X76057 


1 


5 


U15172 


2 


17 


M86707 


10 


17 


L36151 


2 


S 


U5334 6 


1 


2 


D27460 
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Target Variances by Field Table 
for Conditionally Essential Genes 

Conditionally Essential Biosynthetic Enzymes 

Validation: Conditionally Essential 

ID Name „ x 

Variances Chromosome Genbank 
Identified Sequence 

1536 5-methyltetrahydrofolate-homocysteine 3 

raethyltransferase U75743 

1539 Glutamate-ammonia ligase (glutamine 5 

synthase) b 1( 3 31 X59834 

Proteins that Repair Radiation Induced DNA Damage 

Validation: Conditionally Essential 

ID Name 

Variances Chromosome Genbank 
- _ Identified Sequence 

1541 Fanconi anemia complementation'group'c [ Y q H'l ^894 

Proteins of DNA Repair 

Validation: Conditionally Essential 

ID Name , f . 

variances Chromosome Genbank 
Identified Sequence 

1528 DNA excision repair protein ERCC5 4 " ~\^"~ 

1530 HHR23A protein 13c 3 33 D1S305 

1532 DNA EXCISION REPAIR PROTEIN ERCC-1 2 ,, nn , , , ' D2123S 

1533 DNA repair he li case ERCC3 ? 19ql3 . 2-ql3 . 3 M13194 
1537 URACIL -DNA GLYCOSYLASE 1 PRECURSOR \ 2q " M318 " 
1526 Damage-specific DNA binding protein 1 2 „ ,1 X15S53 

{127 kD) 2 11 * 1S AJ0029S5 

Proteins that repair chemically induced DNA damage 

Validation: Conditionally Essential 

ID Name „ , 

t^i/L n c i variances Chromosome Genbank 

1534 O-S-^ethylguanxne-DKAmethyUransferase 4 10q26 M60761 
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Target 


Loc'n 


Sequence around 


# Varia 1 


# Varia 2 Protein 


ID 






[polymorphs, am] 


(Lib) 


(Lib) Change 


1 


.01 




CGGCCATGTA [C/T J GTGGCCATCC 


71 (36) 


1 (1) 


Silent 




02 


250 


ACGAGGCCCA [G/A] AGCAAGCGTG 


71 (36) 


1 (1) 


Silent 




.03 


1003 


CGGGCATTGC [C/T] GACAGGATGC 


66 (35) 


6 (5) 


Silent 




.04 


801 


ACGAGCTGCC [C/T] GATGGCCAGG 


71 (36) 


1 (1) 


Silent 




.05 


1201 


AATGCTTCTA [ A/G] ACGGACTCAG 


71 (36) 


1 (1) 


Silent 




.06 


991 


CCACCATGTA tC/T] CCGGGCATTG 


17 (17) 


56 (35) 


Silent 




.07 


1099 


TGTGGATCGG [T/C] GGCTCCATCC 


71 (36) 


1 (1) 


Silent 




.08 


499 


GTGCTGTCCCT [C/G] TACGCCTCT 


65 (65) 


7 (7) 


Silent 


4 .01 


2168 


CCGCCAGTAG t C/T] ATCAGCTTTA 


61 (34 ) 


11 (9) 


3 'UT 




.02 


388 


TGGAAAGCCA ( C/T ] GGGGAGCCGA 


62(29) 


10 (7) 


Thr->Met 




.03 


491 


AGAGGAGAGA (T/C] GAGAGAAAGA 


68(36) 


4(4) 


Silent 




.04 


1171 


AAAACTAATT [T/C] GGATAGAAAG 


68(36) 


4(4) 


Leu->Ala 




.05 


336 


TCGGGATGCC [C/T] TGCAGAAGGA 


71 (36) 


1 (1) 


Silent 


5.01 


421 


ACGTCCCAAC [G/A] AAGAGACCAC 


66 (36) 


6(6) 


Silent 


e 


. 01 


1570 


CTCCGTCCA [T/C] TGTACTATCTG 


70 (36) 


2(2) 


Silent 




. 02 


778 


TCCACGTCCT [C/G] GTGCTGATGC 


71(36) 


1 (1) 


Silent 




.03 


158 


GGACACACTT [T/C] TGAAGCTTCT 


71(36) 


1(1) 


Silent 


9 


. 01 


1929 


CCATGCACCA [C/A] GAGGACTTTA 


71 C\R) 
/ x v j o ; 


1(1) 


His->Gln 


10 


.01 


1099 


AACCGTGTCAGGGAAACACCA 


69 (36) 


3 (3) 


Gly->Arg 


14 


.01 


911 


CAATTCAATC TG/A1 CCGCCCT AAA 


69 (36 ) 


3 (3) 


Arg->His 




.02 


1174 


CAAACAGTAA [G/A] TGAAAATGGT 


71 (36 ) 


1 (1) 


20 


.01 


1627 


CCCAGCACAT [C/T] ACCTATGTGC 


44 (30) 


28 (21) 


Silent 




.02 


2041 


GCCGAAGTGT I C/G] CGGTTCTCTG 


71 (36 ) 


1(1) 


Asp->Glu 




.03 


1393 


caqccatcca f c/t 1 aaaotcataa 


71 (36 ) 


1 (1) 


Silent 


22 


.01 


4008 


CAACAAAAAC [ A/C] AAATTCACAA 


71(36) 


1(1) 


Silent 




.02 


4446 


AGCCATCCAC [T/G] TCTGATGATT 


71(36) 


1(1) 


Silent 


24 


.01 


1101 


GCCACTGGCA [G/A] TAAAGGATAT 


71(36) 


1(1) 


Val->Ile 


28 


.01 


5009 


TGCCACGCCC [G/C] TGTTTGGGCA 


70 (36) 


2(2) 


Val->Leu 




,02 


2023 


AGAAATCACC [C/T] AGGATAACCC 


71 (36) 


1 (1) 


Silent 




.03 


2041 


CCCCTCCAGC [G/A J GCAAAGCCAG 


71 (36) 


1 (1) 


Silent 


29 


.01 


1768 


CCCTGCCACT [A/C] GAGTCCGGCC 


67 (36) 


5(5) 


Silent 




.02 


2781 


AGGAGCATCC [G/A] TCTAAAACTA 


70 (36) 


2 (2) 


Silent 




.03 




2 bp deletion 






3 'UT 


32 


.01 


1171 


AAAACTAATT [T/C] GGATAGAAAG 


70 (36) 


2(2) 


Leu->Ala 




.02 


388 


TGGAAAGCCA [C/T] GGGGAGCCGA 


59(33) 


13 (10) 


Pro->Met 




. 03 


2168 


CCG CCAGTAG CATCAGCTTTA 


60 (34) 


12 (10) 


Silent 


33 


.01 


2397 


GGCTAGATGG [T/C] CTGGCCAAAA 


47(33) 


25(12) 


Silent 




.02 


3708 


AGGTCGGGGT [C/T] GATGTCAACC 


63 (35) 


9(8) 


Silent 




,03 


3795 


GGACCCACCT [C/A] CTGAAGATCC 


62 (35) 


10(9) 


Silent 


524 


.01 


1598 


CACAAGTTGA [G/A] GAGGGCGATA 


68(36) 


4 (4) 


Silent 




.02 


2548 


CTTATATTTC [ T ] 10 GATGTCAACC 


71(36) 


1(1) 


3 'UT 




.03 


3158 


AAAATTGTCT [GTTT] GTTTTCTCAT 


50 (34) 


22 (20) 


3 1 UT 


525 


.01 


255 


CTGCGGTTCT [C/T] GAGGGCGATA 


54 (34) 


18(16) 


Silent 




.02 


346 


CGTGCCGGCT [C/T] TTCACCATCC 


71 (36) 


1(1) 


Leu->Phe 




.03 


523 


CCCCATCCTC [A/G J TCCCGTGCCA 


63 (36) 


9(9) 


Ile->Val 



1025.01 1051 CAACTAACCA [G/A] ACAACTGGGA 24 (20) 48 (44) 3'UT 
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.11 


418 


GCCCCTTTTG [C/T] AGCCCACGGC 


6 


(5) 


5 


(3) 


N/D 




.12 


640 


CAACTAACCA [G/A] ACAACTGGGA 


15 


( 7) 


7 


( 6) 


N/D 


1026 


.2 


47 


GTCTGGACGC [G/A] ACGGCGGCGG 


2 


( 2) 


3 


( 2) 


5' UT 




.9 


262 


LuLALCCCTT i G/A] GAGCACAAGA 


28 


(13) 


4 


( 1) 


Silent 




.19 


602 


ATAAAGTATAG CGG [A/G] AGAGAN 


5 


( 5) 


11 


( 8) 


3 1 UT 


1027 


.2 


405 


TGGAAGAGAT [T/C] ATTGATGACA 


2 


( 2) 


2 


( 2) 


Silent 




.6 


942 


GGACAAAAAG [A/G] TATGACTCCA 


8 


( 8) 


4 


( 4) 


Silent 




.16 


1361 


CAGGAAGGCA [C/AJ CCCTGAGGGG 


13 


(11) 


3 


; 3) 


Thr -> 


1031 


.31 




CCTTCGCCCA [G/A] CTGCGCCTCG 


9 


( 7) 


2 


( 2) 


Silent 




.32 


2991 


CTTCGCCCAG [C/Q] TGCGCCTCGG 


4 


( 4) 


4 


( 4) 


Leu - > 


1032 


.1 


3 


AGTCGCCG [G/A] GGAGGACGGTCT 


5 


( 4) 


3 


f 3 ) 


3 U 1 




.2 


4 


GTCGCCG [G/A] GAGGACGGTCTGC 


5 


( 5) 


3 


( 2) 


5' UT 




.3 


£9 


CCGCCGCGGC [G/A] AAGATGGCCT 


5 


( 5) 


2 


( 2) 


5' UT 




.10 


312 


AAAAAGATTG [T/C] CGCTATGCTT 


8 


( 8) 


3 


( 3) 


Silent 


1037.20 


2919 


TGGTTATGGG [G/C] GTGCCAGAGG 


15 


(13 ) 


2 


l * ; 


3 ' UT 


1038 


. 5 


723 


CAGGTCCTGG [G/C] CCCCAAGCCT 


7 


( 7) 


3 


( 3) 


Silent 




.10 


662 


ACTCCAGCCC [C/A] TTTGCCCTTG 


5 




X J 


ml 


Silent 




. 13 


1053 


CCTCAGGGCC [G/A] TGAGAGTCCC 


13 


(10 ) 


g 


^ \ 

\ f i 


Arg - > 


103 9 


.19 


1665 


ACCATGTCTC [A/G] GTTTATTTTT 


2 


( 2) 


6 


i 5) 


3' UT 




.23 


1748 


TATTTGAGTA [G/A] AAAATCACTT 


3 


\ J / 


2 


"3 \ 
* I 


3 ' UT 


1040 


.7 


2056 


GCTGAAGAAG [T/C] CTTCGAGGCT 


20 


(16) 


2 


( 2) 


3 1 UT 


1043 


.1 


351 


ACTTGAAGGA [T/C] GAAAGTGGCT 


2 


( 2) 


3 


( 3) 


Silent 




.2 


372 


TCAAAGATCC [C/T] TCCAGCGACT 


2 


( 2) 


3 


( 3) 


Silent 


1048 


.3 


341 


GCTACGCGAA [G/A] CTCTTTGCTG 


2 


( 2 ) 


2 




Silent 


1049 


.10 


2648 


CCTGAAACCC [T/A] GAAGCTGATG 


5 


( 4) 


3 




1 1 TTT 
J U I 




.12 


2768 


CAGTGGTAGC [G/A] ATGGAAAAAA 


8 


( 6) 


2 


f 1 ) 


3 1 UT 


1050 


.11 


2381 


CAGGAAGAAG [A/G] TATTCCAGGA 


4 


( 2) 


2 


( 2) 


lie - > 




.13 


2750 


TTTTGCCAGC [G/A] TAGTGCTCCT 


2 


( 2) 


2 


( 1) 


Val - > 




.14 


3034 


GAGTCCAGAG [T/C] GCTGCCAGGA 


2 


( 2) 


2 


( 1) 


3 ' UT 


1051 


.10 


260 


AGCTGGCAAG [C/T] TACTTTTCAG 


15 


(10) 


3 


( 1) 


3 ■ UT 




.18 


409 


TTTGCTTCTT [G/A] AGTAGAGCCA 


17 


(12) 


3 


( 1) 


3 1 UT 


1052 


.7 


428 


TGTACAAATC [T/C] TT CATC CAT A 


7 


( 6) 


2 


( 2) 


3 ' UT 


1053 


.24 


4113 


AGGAGAAGAC [C/T] TACCGGCGGC 


8 


( 7) 


8 


( 8) 


Silent 


1055 


.17 


3122 


CAGCGTCAGC [C/A] AGCTCAGCCT 


4 


( 4) 


4 


( 4) 


3 • UT 




.23 


3450 


TGAGAAGGGC [T/C] TGGGACAAGA 


26 


(12) 


3 


( 3) 


3 • UT 




.25 


3568 


TCAAAAAACC [T/C] TTTTTTTCTG 


26 


(12) 


2 


( 2) 


3' UT 




.01 


2061 


AGGCTGGTCG I C/T] GAACTCCTGA 


61 


(34) 


11 


(9) 


3 'UT 




.02 


2419 


TTAAAAGATA [C/A] GCATGTCTTC 


59(33) 


13 £10) 


3'UT 




.03 


3047 


TAAGTCTTTT [G/T] AGTGTCATCA 


71(36) 


1 


(1) 


3 1 UT 




.04 


2960 


TATTACTCAC [G/A] TATACCCCAT 


71(36) 


1 


(1) 


3 'UT 




.05 


3450 


TGAGAAGGGC [T/C] TGGGACAAGA 


60(33) 


12 


(9) 


3 'UT 




.06 


3296 


CTGCAAAGAG [T/C] GTACTGTGCT 


60(33) 


12 


(9) 


3 'UT 


1056 


.12 


407 


CAAGAGCACC [G/C] GTGGGGCCCC 


13 


( 9) 


2 


( 2) 


Val -> 


1057 


.20 


3067 


TAACTTTTCG [G/A] TCTTTCCCAT 


7 


( 5) 


3 


( 3) 


3' UT 


1059 


.11 


1130 


AACGTGAGTG [A/G] CATTTTCCGA 


5 


( 5) 


2 


( 2) 


Asp -> 



His 
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Target Loc'n Sequence around # Varia 1 # Varia 2 Protein 

ID [polymorphism] (Lib) (Lib) Change 





19 


1327 


AATCATCCGA [G/A] GTCCTGAGGA 


19 


(14) 


3 


( 3) 


Val -> 


Ser 




27 




(jWjAtj(jL.CTG iU/AJ GGCTGGGCCC 


15 


(11) 


2 


( 2) 


Gly -> 


Arg 


1063. 


21 


705 


CGGACATGGC [C/T] CAGCTGGAGG 


8 


( 7) 


8 


( 7) 


Silent 






22 


721 


GGAGGCCTGT [O/T] TGCGCTCTAT 


16 


(14) 


2 


( 2) 


Val -> 


Leu 




38 


94 9 


GCGTGCGTGA [G/A] GGCCCTGCCA 


2 


( 2) 


2 


( 2) 


3 ' UT 




1068 . 


30 


2756 


GCGCCGCGGT [O/CJ GCTACCGCCA 


21 


(15) 


2 


( 2) 


Ala -> 


Arg 


1069. 


10 


1199 


GGGCGCCAGC [C/O] GAGTGCTTAT 


17 


(13) 


2 


( 2) 


Arg -> 


Glu 


1070. 


3 


303 


AAGAGGATGG [G/T] CAGGAGTATG 


3 


( 2) 


6 


( 6) 


Gly -> 


Val 




7 


615 


ACATTGGAGA (T/C) GATGATGAAG 


6 


( 6) 


2 


E 1) 


Silent 






12 


1092 


GAAGTCTGCA [Q/T] TTGAAGAAAA 


5 


( 5) 


3 


( 3) 


3 ' UT 




1072 


20 


1309 


TCACGAGATT (T/C) GCCAGGGGCA 


15 


(10) 


2 


( 2) 


3' UT 




.21 


1310 


CACGAGATTT [G/T] CCAGGGGCAT 


4 


( 3) 


5 


( S) 


3 ' UT 




1073 


2 


65 


GGCCCAGAGG [G/A] AATGGACCCC 


2 


( 1) 


2 


( 2) 


Silent 




1074 


18 


1428 


TTGTGTGATT [T/C) CCTAAACATA 


5 


( 4) 


2 


( 2) 


3 ' UT 






21 


1650 


TTGTCTTTTA [G/A] ACAACTAGAT 


6 


( 6) 


3 


( 3) 


3' UT 






22 


1652 


GTCTTTTAGA [C/A] AACTAGATTT 


5 


( 5) 


3 


( 3) 


3 1 UT 




1077 


.19 


1275 


TATAATAATT [G/T] TATGGTACCT 


3 


( 2) 


3 


( 3) 


3' UT 






.22 


1585 


ATGTACATAA [T/A] TTTGAGGTAG 


7 


( 5) 


3 


( 1) 


3 * UT 






.30 


2336 


TCAGGCACCC [A/G] TAGAAAGACC 


4 


( 3) 


10 


( 9) 


3 ' UT 






.34 


2460 


GAATTGGCCC [0/A] CTGGTACCAA 


5 


( 4) 


16 


(14) 


3 ' UT 




1079 


.11 


2035 


CTGCTGTAGT [T/C] GCTCCATTCA 


19 


(14) 


2 


( 1) 


Silent 






.18 


2347 


GCAACATCAC ( A/G] TGGGCTGATG 


25 


(17) 


2 


( 2) 


Silent 




1080 


.24 


2367 


TGCCTGAGGAtA/C] GGGCAGGGCC 


1 


( 1) 


5 


( 4) 


3' UT 




1081 


.17 


805 


GATTGATAGA I G/A] AGAAACTGCG 


13 


( 8) 


2 


( 1) 


Ser -> 


Lys 




.36 


1178 


ATGCATATTGTAAAATAAA [A/G] A 


2 


( 2) 


10 


( 9) 


3 ■ UT 




1082 


.19 


767 


TTCGGGGCCT [C/T] CGCCGGCACC 


7 


( 5) 


2 


( 2) 


Ser - > 


Phe 




.27 


924 


ACGTGGACGA[C/A] CCCACGGGGA 


3 


( 3) 


3 


( 3) 


Asp -> 


Glu 




.40 


1333 


GTCTACAGAT [Q/T] GGCTGTGGCC 


4 


( 4} 


5 


( 5) 


3 ' UT 




1088 


.11 


112 


CCGAGGGGGA [C/T] GCGCTGGATG 


23 


(16) 


7 


( 5) 


Silent 






.12 


144 


AAGCGCTACT [G/C] CTGCCGCCGG 


24 


(18) 


5 


( 4) 


Cys - > 


Ser 




.13 


145 


AGCGCTACTG [C/G] TGCCGCCGGA 


21 


(16) 


5 


( 4) 


Cys -> 


Trp 




.20 


226 


GACCACGCTG [A/G] AACCCACCCA 


23 


(16) 


18 


(11) 


3* UT 






.21 


238 


ACCCACCCAC [C/A] CGCTGTGCTG 


31 


(19) 


3 


( 3) 


3' UT 






.24 


270 


TGAGCGTCCT [A/G] CCCCGAATTC 


29 


(18) 


9 


( 6) 


3' UT 






.27 


338 


GTGTGTATCC [C/G] ATACCCCACT 


23 


(15) 


2 


( 2) 


3' UT 




1090 


.18 


4153 


GTGTAAAATA [T/C] GCTGCTTGGA 


13 


(12) 


2 


( 2) 


3* UT 






.21 


4215 


CT CACAGTAA [T/C] CTTCACACTT 


21 


(16) 


2 


( 1) 


3' UT 




1091 


.3 


793 


AGGATCCCCC [A/G] CCGCCTATGG 


2 


( 1) 


5 


( 2) 


Silent 






.9 


962 


CTTTCTTGTG [C/T] CCCTTCTGAG 


4 


( 3) 


5 


( 2) 


Pro -> 


Ser 




.14 


2078 


AAGAGGTGCAtA/G] TGTGATCTGA 


6 


( 5) 


11 


( 8) 


3- ut 




1092 


.5 


342 


CCTGGAGGCG [G/C] CCAACGGCGA 


16 


( 8) 


4 


( 1) 


Ala -> 


Pro 




.10 


401 


GGCCTGGGCC [C/T] TCCCGCGACT 


9 


( 6) 


11 


( 5) 


Silent 






.11 


503 


AGATCGACAA [C/T] GCCCGTCTGG 


11 


( 6) 


6 


( 5) 


Silent 






.22 


1034 


TTGGAGCCCA [G/C] CTGGCGCATA 


4 


( 4) 


3 


( 2) 


Gin -> 


His 
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Target Loc'n Sequence around # Varia 1 # Varia 2 Protein 

ID [polymorphism] (Lib) {Lib) Change 



.23 103 5 TGGAGCCCAG [C/Ol TGGCGCATAT 3(3) 3(2) 



1093.2 


258 


CTCTCACAGA [C/T] GAGATCAACT 


3 


( 2) 


2 


( 1) 


Silent 




.3 


330 


CAGACACATC [T/C] GTGGTGCTGT 


3 


( 2) 


3 


( 2) 


Silent 




.4 


339 


CTGTGGTGCT [G/A] TCCATGGACA 


3 


( 2) 


3 


( 2) 


Silent 




.6 


420 


TTGCTCAGAG [A/G] AGCCGGGCTG 


3 


( 2) 


3 


( 2) 


Silent 




.22 


954 


GCGTTGGAGG [T/CJ GGCTTCAGTT 


7 


( 2) 


3 


( 1) 


Val -> 


Ala 


.23 


960 


GAGGTGGCTT [C/T] AGTTCCAGCA 


7 


( 2) 


3 


( 1) 


Silent 


.24 


972 


GTTCCAGCAG [T/CJ GGCAGAGCCA 


7 


( 2) 


3 


( 1) 


Silent 




.27 


983 


GGCAGAGCCA [T/C) TGGGGGTGGC 


7 


( 2) 


3 


( 1) 


lie - > 


TV"i *- 

inr 


.28 


1065 


GGAAGAGCTA [T/C) AAGCACTAAA 


9 


( 3) 


3 


[ 1 ) 




.44 


1198 


TAGAGCTGGG [G/T] ATGAATGCTT 


13 


( 2) 


3 


( 1 \ 


3' UT 




.45 


1202 


GCTGGGGATG [A/G] ATGCTTAGTG 


13 


( 2) 


4 


( 1) 


3' UT 




. 49 


1579 


TGTGCTCTTC [A/G] CTCTTTGCAA 


14 


( 3) 


5 


( 2) 


3' UT 




.50 


1582 


GCTCTTCACT [C/G] TTTGCAATTG 


13 


( 3) 




\ J t 


3' UT 




1094.24 


3103 


TGCTTTTGCT [C/G] GCTTTGGCCA 


15 


( 9) 


4 


I 2 ) 


3' UT 




.25 


3104 


GCTTTTGCTC [G/C] CTTTGGCCAG 


2 


( 2} 


4 


( 2) 


3' UT 




1095.17 


2885 


CGTAGGAAGG [G/C] CCTCAGTGAA 


18 


(11) 


2 




Silent 




.25 


2994 


GTGGACTCCT [G/T] GGAGCTCCTG 


14 


(10) 


3 




3' UT 




.31 


3246 


GGGGATGAAA [C/A) CCCAAGGGGC 


10 


( 7) 


12 


(11) 


3' UT 




1098. 10 


1486 


GGCAGTGGCC [G/C] CCCTGGGTGA 


8 


( 7) 


3 


( 3) 


Ala -> 


Pro 


. 13 


1522 


CACGTATGAG [G/C] ACATCCAGAC 


2 


( 1) 


12 


(10) 


Asp - > 


His 


.21 


1740 


TGCATTCTTT [T/C] GGAACTCAAT 


11 


( 6) 


2 


( 2) 


3' UT 




.25 


1850 


GGAGGGCGGT [C/T] GGTGCTTCCC 


21 


(13) 


2 


( 2) 


3 ' UT 




.29 


1942 


TGACCTATCA [A/G] AGCCTCCCGG 


16 


(11) 


6 


( .5) 


3' UT 




. 3 5 


2 029 


CCAAGGAGCG [C/A] GCTCCACGCG 


13 


(10) 


2 


( 2) 


3 ' UT 




1099.36 


7590 


TGGTTTGAGA [ G / C ] CTGGCGCTAC 


12 


(11) 


6 


( 4 ) 


3' UT 




.37 


7591 


GGTTTGAGAG [C/G] TGGCGCTACC 


9 


( 8) 


6 


( 4 ) 


3' UT 




.44 


7705 


ATGGATCTGA [C/T] CCCTGTCAGA 


13 


(12) 


9 


( 8) 


3' UT 




. 01 


215 


ATTCCTCAGT [C/T] CTTCATGATG 


63 (36) 




9(9) 


Ile->Val 


.02 




Nucleotide repeat 


66 


(35) 




6 (5) 


3 'UT 




1100.16 


3865 


ATTGGGTCCT [C/G] AGCCTTCTGG 


4 


( 3) 


4 


( 3) 


3' UT 




.17 


3904 


GGACAAAGCC [T/C] TTTCATCTGA 


2 


( 2) 


4 


( 3) 


3' UT 




.19 


3994 


GGTGGAGTTC [T/C] TCCATGCAGG 


6 


( 6) 


e 


( 5) 


3» UT 




.22 


4046 


TATCCGAGGT [G/T] CTGCCGGGGC 


6 


( 6) 


5 


( 5) 


3 * UT 




1102.29 


1967 


TAACTTGGGT [T/G] TGAAAAAAAT 


2 


( 1) 


25 


(20) 


3 ' UT 




.30 


1982 


AAAAATAAAA [T/G] TCCTAAATTT 


2 


( 1) 


24 


(20) 


3 ' UT 




.31 


1991 


AAAAATAAAATTCCTAAAT [T/C] T 


2 


( 1) 


21 


(17) 


3' UT 




1105.15 


2038 


GGGCCTGCCT [G/C] TGAGTGGTGC 


3 


( 3) 


6 


( 6) 


3' UT 




1109.4 


884 


AGCTTGCCTG [C/T] TTCAGCAAAA 


4 


( 4) 


2 


( 1) 


3' UT 




1110.11 


6466 


CTGATGCAGA [T/C] TCTTGTCTTG 


5 


( 5) 


5 


( 5) 


3' UT 




1111.8 


794 


AAGACGGCTA [T/C] GAGTTCTTTG 


2 


( 1) 


7 


( 6) 


Silent 




.15 


1087 


CTGCCATGCT [G/T] GGGGGGGGTC 


8 


( 5) 


4 


( 4) 


3' UT 




.16 


1110 


CCCGACCCCT [A/C] AGGCCCACCT 


3 


( 1) 


18 


(17) 


3' UT 




.17 


1146 


GAGCCTTGGT [G/T] TATTTTTCTT 


22 


(18) 


4 


( 4) 


3 ' UT 




1114 .18 


540 


ATGCTACCTA [C/T] CGGGAAGGCA 


29 


(16) 


2 


( 2) 


Silent 




.20 


585 


TCACTGCCAA [T/A] GCTCTCGCTT 


22 


(15) 


6 


< 4) 


Asn -> 


Lys 


.21 


586 


CACTGCCAAT [G/T] CTCTCGCTTT 


16 


(12) 


6 


( 4) 


Ala -> 


Ser 


.27 


704 


CCCAAATTCG [C/T] CGTTGCCACT 


20 


(14) 


3 


( 3) 


Ala -> 


Val 


.01 


177 


GAACAACCAC [T/C] GGGTCCTACA 


70(36) 


2 


(2) 


Silent 




.02 


328 


ACTGAATGAG [C/G] CTCCACTGGT 


71(36) 


1 


(1) 


Pro->Ala 
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.03 328 GGCCGGAGGC [A/G] TTCACTCCAG 30 (20) 42(32) Silent 
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ID Ipolymorphiaa] (Lib) (Lib) Change 



.5 


130 


CTTCCAAAGG [T/Cl CCGGAAAACT 


8 


( 7) 


14 


t A \ 

I 4 ) 


Val -> 


Ala 


.15 


643 


TTCAACGACC fT/Cl GGGCTrrnnA 


11 


( 8) 


2 


( 1) 


Leu - > 


Pro 


.16 


732 


CAAGAAGGGG [A/C] CCAGGCTTGG 


12 


( 7) 


4 


( 2) 


Thr -> 


Pro 


1116.2 


121 


CGGACCGTCC TT/Al f3A(""par , ar"rr 


2 


t 1) 


4 


( 4) 


Silent 




.3 


173 


CCGGGGAATG [A/C] AGCCCACAGA 


2 


< 1) 


5 


( 5) 


Lys -> 


Gin 


1117 . 1 


IS 


V-V»lVjV-AL»L-L.t. LT/CJ GGCCTTCCGC 


10 


< 7} 


4 


( 3) 


5* UT 




.2 


16 


CTGCAGCCCT [G/T] GCCTTCCGCC 


10 


< 7) 


4 


( 3) 


5' UT 




.5 


19 


CAGCCCTGGC [C/T] TTCCGCCACC 


10 


( 7) 


2 


( 2) 


5' UT 




. 19 


401 


^ >jvj\»_fvjv,v^ 1 1 IV*/ A J (j^LAAWjCCC 


12 


( 7} 


8 


( 4) 


Leu - > 




.01 


1287 


«v.UiiwUALl It./ t*J ACCAACGCCA 


65 (36} 




7 (7) 


Ser->Val 


.02 


338S 


TTGCCTGGAC [G/A] TTGGCCTGCG 


70{36) 




2 (2) 


3 'UT 




1118 .5 


1681 


wiu\i\j(jl iu IU/AJ TTATGCACAA 


6 ' 


( 5) 


2 


( 1) 


Val -> 


Asp 


.28 


2945 


ATGATTAAGG [A/G] CCAGAGGATC 


7 


( 6) 


7 


( 5) 


3' UT 




1119 . 11 


1075 


iUftt-AAATTA IG/AJ GCCACGGCCC 


3 


( 3) 


3 


( 3) 


3' UT 




1121 . 17 


1524 


CATCCGTTGC ( A/G] TATGGCTGCA 


3 


( 3) 


2 


( 2) 


Silent 




.23 


1669 


rtaCACGTCTG [G/C] CCAATATTGA 


6 


( 6) 


3 


( 3) 


Ala - > 


Pro 


.27 


1902 


GACAGACTGG [G/A] AAAATATTGA 


2 


( 2) 


20 


(17) 


Gly -> 


Glu 


1123 .9 


2485 


CCTGATATGA [A/C] TGTTACTAAA 


5 


< 5} 


4 


( 4) 


Asn - > 


Thr 


.17 


2807 


TTGACATAAC [T/C] ATCTTTTTGA 


4 


( 3) 


3 


( 3) 


3 * UT 




1124.2 


119 


TCTTATCGGA [G/A] CTTGTATGTG 


2 


( 1) 


3 


( 3) 


5 ' UT 




.7 


3616 


TACTCCATAC [G/T] CACTTCAAGC 


2 


t 1) 


5 


( 3) 


Ala - > 


Ser 


1127 . 2 


4 


HjUAAAA LU/AJ CGCAGGATCAAGG 


13 


( 8) 


2 


( 1) 


mx a - > 


Thr 


.15 


75 


TCAACATCTG [T/C] GTTGGGGAGA 


22 


(14) 


2 


< 1) 


Silent 




.34 


339 


AGGAACACAT [T/C) GATCTGGGTA 


2 


( 2) 


31 


(16) 


Silent 




112B . 9 


483 


WAliWuWiAAAA LA/ wj AAAACCC 


4 


( 3) 


4 


( 3) 


3' UT 




. 10 


4 84 


/UWilAAAAAAAAAA IA/TJ AAACJCC 


4 


( 3) 


4 


( 3) 


3' UT 




1130 . 7 


248 


LW-LLluLW; I G/T J TGAAGAACTT 


25 


(12) 


9 


( 4) 


Val -> 


Leu 


. 11 


320 


uoaaialuWj [U/ T J ACCTGACCAC 


26 


(12) 


2 


( 1) 


Asp -> 


Tyr 


.13 


364 


/a^-^_o>vj/\L_rt l LU/ 1 J In i.Tjt-H-WjL. 


15 


(10) 


3 


( 2) 


Met -> 


lie 


. 16 


377 




16 


( 8) 


4 


( 3) 


Arg - > 


Ala 


. 19 


421 


1 UWtkjrlJiUjA 1 1 V_/ 1 J IjwuU 1 UtuUi 


12 


( 7) 


2 


( 1) 


Silent 




1131.12 


502 


TGGCTGACCA [G/A] GCTGAGGCCC 


18 


(13) 






Silent 




1133 .20 


279 


CTGAGTCTGC [C/T] ATGAAGAAGA 


41 


(IB) 


2 


( 1) 


Silent 




.35 


517 


CCTAATTCTG [A/G] ATATATATAT 


19 


(12) 


4 


( 2) 


3* UT 




1135.22 


301 


AAAACAAGAC [T/G] GGGGCTGCTC 


38 


(20) 


8 


( 4) 


Silent 




.23 


343 


CGGGCTACTA [C/T] AAAGTTCTGG 


40 


(18) 


4 


( 2) 


Silent 




.32 


438 


AAGAGTGTTG [G/A] GGGGGCCTGT 


32 


(18) 


2 


( 2) 


Gly -> 


Ser 


1136.1 


13 


CGCCGCTGCG [G/A] AGGGAGCCGC 


9 


( 9) 


10 


( 6) 


5' UT 




.16 


190 


GGAGCCGGCA [G/A] CCGACGGCAA 


31 


(21) 


5 


( 4) 


Ala -> 


Thr 


.18 


197 


GCAGCCGACG [G/C] CAAAGGTGTC 


32 


(23) 


5 


( 5) 


Silent 




.19 


198 


CAGCCGACGG [C/A] AAAGGTGTCG 


21 


(16) 


8 


( 5) 


Ala -> 


Glu 


.23 


243 


GCCAGCGGAA [G/C] CCTGCCACCT 


31 


(20) 


5 


( 5) 


Lys -> 


Asn 


.24 


244 


CCAGCGGAAG [C/G] CTGCCACCTC 


31 


(20) 


5 


( 5) 


Pro -> 


Ala 


.25 


245 


CAGCGGAAGC [C/T] TGCCACCTCC 


31 


(22) 


6 


( 3) 


Pro -> 


Leu 


.29 


283 


CAACAAGAAT [G/C] CTCGCGCCAC 


26 


(18) 


5 


( 5) 


Ala -> 


Pro 


.30 


284 


AACAAGAATG [C/G] TCGCGCCACG 


26 


(18) 


5 


( 5) 


Ala -> 


Val 


.32 


286 


CAAGAATGCT [C/T] GCGCCACGCT 


31 


(22) 


2 


( 2) 


Arg -> 


Cys 


.41 


387 


TCCTGCGCAC [G/C] CAGAAGCCTG 


2 


( 2) 


19 


(14) 


Silent 
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Targe t Loc 




Sequence around # Varia 


1 


# Varia 2 


Protein 






XD 






[polymorph! sm] { Lib ) 




f Lib} 


Change 






1137 


.1 


3 


CTTCCTTC [G/T] AGGAGGTGGCAG 


2 


( 2) 


3 


( 2) 


5' UT 






. 15 


331 


GTGCCGAGAT [C/T] GCTCACAATG 


22 


(12) 


4 


( 2) 


Silent 






, 23 


419 


CAATGCCAGG [C/Q] TGCGCAGTGA 


13 


( 9) 


3 


( 2) 


Leu - > 


Val 




. 25 


488 


TAAAAACTGC [C/A] ATCTGGCATC 


8 


( 8) 


4 


( 4) 


3 ' UT 




1138 


.8 


78 


AGGAGGAGCT [G/T] CTGAAACAGC 


30 


(17) 


2 


( 2) 


Silent 






.14 


127 


GCTGCGCGTC [G/A] CCAAAGTGAC 


3 1 




2 


( 2) 


Ala - > 


Thr 




.24 


354 


AGCAGCAGCG [G/T] AAGGAGCGGC 


28 


(16) 


2 


( 2) 


Silent 




1139 


.21 


334 


TTCCGAAGCA [A/G] TCTTCCTGCT 


33 


(20) 


3 


( 1) 


Asn - > 


Ser 


1140 


. 3 


17 


CCGCTGCTCG [C/A] CATGTCTTCT 


22 


(15) 


3 




5 ' UT 






. 20 


341 


AATATGTAAG [G/A] CCTTTCTTTT 


32 


(16) 


2 


( 2) 


3' UT 




1141 


.5 


201 


ATCAGACTAG [A/T] GCTGAGTCTT 


2 


( 1) 






Arg - > 


Ser 




.7 


346 


GCGCCGTTGG [C/A] ATCGTAGAGT 


4 


( 3) 


3 


( 2) 


His -> 


Asn 




. 18 


1071 




5 


( 4 ) 


6 


( 3) 


Silent 






.21 


1376 


TGTTATACAGG CAG TG A (G/A] AAA 


14 


(10) 


5 


{ 4 \ 


3' UT 




1142 


. 13 


556 




8 


( 7) 


3 


t "i ) 
\ j i 


Asp - > 


Ala 


1143 


. 17 


470 


ATCTACAAGC [G/T] TGGTTATGGC 


32 


(20) 


2 


( 2) 


Arg - > 


Leu 


1144 


.1 


211 


GCCGCGGCGC [G/C] CCCCTCGCCA 


7 


( 5) 


4 


( 4) 


Silent 






.5 


2B6 


CCGCCGAGGG [C/A] ATTCACACGG 


11 


( 9) 


5 


( 4) 


Ala -> 


Glu 




.6 


2 B7 


V-OV_V-o/V«*jVjL. LA/ 1 J A A UALALAjLAj 


15 


(13 ) 


4 


( 3) 


lie - > 


Phe 




.17 


494 


TGTGAAGCTG [C/T] CCTCCGGCTC 


9 


( 8) 


2 


( 2) 


Pro - > 


Ser 




.26 


700 




24 


(18 ) 


2 


( 2 ) 


Silent 




1145 


. 18 


3 95 




2 1 


( 14 ) 


7 


( 7) 


Silent 






.20 


405 


CATCCG CAGG [G/T] TTCGGATGAG 


27 


(20) 


2 




Val -> 


Phe 


1146 


. 16 


276 




16 


(12 ) 


3 


( 3) 


Lys - > 


Asn 




.18 


285 


AGGCCCTGGC [C/A] AACGTCAACA 


13 


(10) 


5 




Silent 






.22 


340 


ACCTGCTCCA [G/C] CAGCTGGTGC 


16 


(12) 


3 


( 3) 


Ala -> 


Pro 




.23 


3 41 




15 


(12) 


3 


( 3) 


Ala - > 


Glu 




.25 


343 


TGCTCCAGCA [G/A] CTGGTGCTGC 


17 


(12) 


2 


( 2) 


Ala -> 


Thr 


1147 


.22 


324 


GAGACTGGCA [G/A] GCCTCGGCCT 


7 


( 5) 


3 


( 3) 


Arg - > 


Lys 


1146 


.29 


390 


TCGGTGACAT [C/T] GTCACAGTGG 


33 


(17) 


3 


( 2) 


Silent 




1149 


. 14 


174 


GAACCGGGGC [C/G] TGCGGCGGAA 


14 


(12) 


3 


( 2) 


Leu - > 


Val 




. 22 


414 


CGTAAAGCAT [G/T] GCCGGCCCGG 


23 


(20) 


4 


( 3) 


Ala -> 


Cys 


1150 


. 20 


257 


CTCAAAGACC [T/C] GGAAAAATGG 


42 


(19) 


2 


( 1) 


Leu - > 


Pro 




.34 


435 


CCTCATGGAC [T/A] AAAAAAAAAA 


7 


< 6) 


4 


( 3) 


3 ' UT 




1151 


.13 


312 


TCCAAAGCCC [T/C] GGTGGCCTAT 


33 


(16) 


6 


( 1) 


Leu - > 


Pro 




.14 


313 


CCAAAGCCCT [G/T] GTGGCCTATT 


33 


(16) 


6 


( 1) 


Silent 






.16 


346 


TGGATGAGGC [T/C] TCCAAGAAGG 


34 


(16) 


2 


( 1) 


Silent 






.22 


439 


AGTTTGGAGG [C/T] CCTGGTGCCC 


20 


(14) 


6 


( 4) 


Ala -> 


Val 




.25 


517 


TAATAAACAG [T/A] TTTTGAGGGA 


23 


(15) 


3 


( 1) 


3» UT 




1152 


.15 


131 


GCGCGTGTGC [G/A] AGGAGATCGC 


34 


(18) 


3 


( 2) 


Ser -> 


Lys 




.19 


160 


CCAGCAAAAA [G/C] CTCCGCAACA 


31 


(18) 


6 


( 4) 


Lys -> 


Asn 




.20 


161 


CAGCAAAAAG [C/G] TCCGCAACAA 


29 


(16) 


5 


( 3) 


Leu -> 


Val 




.24 


184 


TAGCAGGTTA [C/T] GTCACGCATC 


20 


( 9) 


22 


(15) 


Silent 






.31 


379 


CCAACCTTCA [G/A] GTCACTCAGC 


36 


(23) 


2 


( 2) 


Silent 





1154.8 119 GGGCACAGCC [C/T] TAAAGGCCAA 17 ( 9) 3(2) Silent 
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IP 






[polymorphism] (Lib) 




(Lib) 


Change 








39 


477 


TlfJTiUTi an r t / r* 1 T w rT , r , 'jv , T , is tv*^ 
lAuiAAlAAA[I/Uj 11 I LA 1 AlU C 


21 


(15) 


2 


2) 


3 ' UT 




1155 


6 


64 


TATTCTCCGA [G/C] CTTCGCAATG 


29 


(19) 


3 


3) 


5' UT 






7 


65 


ATTCTCCGAG [C/G] TTCGCAATGC 


25 


(17) 


3 


3) 


5' UT 




XJL3 / 






1\jWjU\GGAC IC/QJ GGTTCTCAGG 


18 


(11) 


3 < 


3) 


Silent 






12 


290 


GTCTGTCACA (A/GJ TCTGCTCCTT 


28 


(12) 


11 


7) 


3* UT 




1158 


4 


55 


CGAAAATTCG [G/A] CCAGGGTTCT 


36 


(20) 


2 < 


1) 


Ala -> 


Asp 


1137 


2 




AbCACCAGCG IG/TJ TGGCAGAGAC 


24 


(14) 


2 


1) 


Val -> 


Leu 




/ 




ACAGTGCAGG [G/A] CGGTATGCCG 


16 


(10) 


5 


3 ) 


Gly -> 


Glu 


lib 0 


10 


124 


TCAGGGAGCT [G/A] AATATTACGG 


28 


(18) 


2 


1) 


Glu -> 


Lys 




15 


166 


GTGGTGGTCG [G/A] AAAGCTATCA 


28 


(17) 


2 


2 ) 


Glu -> 


Lys 




l / 


229 


TCCAAGTCCG [C/G] CTAGTACGCG 


2 


( 2) 


29 


19) 


Pro -> 


Ala 


1161 


8 


263 


AAGGCAACGC [C/T] CTGCTGCGGC 


30 


(16) 


2 ( 


2) 


Silent 






9 


264 


AGGCAACGCC [C/T] TGCTGCGGCG 


22 


(14) 


9 ( 


9) 


Silent 






11 


283 


CGGCTGGTCC [G/C] ATTGGGGGTG 


13 


( 9) 


4 ( 


4) 


Arg -> 


Pro 


1163 


8 


1522 


GTACTTCCTC [G/T] TCCTCATGCC 


2 


( 2) 


5 ( 


1) 


Arg - > 


Leu 


1165 


1 


97 


CCACGACCGT [G/C) GCTATCTGGT 


3 


( 3) 


2 




Ala -> 


Arg 




4 


180 


GTGAGGGGCG [G/T] CCGCGGCGCA 


4 


( 3) 


4 ( 


2) 


Silent 




7 


273 


CCAAGGTGGG [C/A] ATCAAGACCA 


10 


( 7) 


4 ( 


3) 


Ala -> 


Glu 




8 


274 


CAAGGTGGGC [A/T] TCAAGACCAT 


20 


(12) 


3 | 


o \ 


He -> 


Phe 




13 


429 


AGCAGGAGCT [G/C] CTCATCAACA 


8 


( 7) 


5 


4) 


Silent 






14 


430 


GCAGGAGCTG [C/T] TCATCAACAT 


5 


( 5) 


8 


5) 


Leu -> 


Phe 




29 


901 


CCCCCAGAGG [G/A] AGGTCACCTG 


13 


(10) 


4 


3 ) 


3 ' UT 






3 5 


1007 


GCTTCCTCCT [G/T] GGCCCTCAAT 


6 


( 5) 


4 


4 ) 


3 ' UT 






3 8 


1189 


GATGTTTTGA [C/G] GAAATAAATT 


2 


( 2) 


7 i 




3* UT 




1170 


2 


410 


ATTGCGAATC [G/C] TTAGATATCC 


2 


( 2) 


2 ( 


2) 


Val -> 


Leu 


1171 


27 


2823 


AAGAGATGAA [A/T] AAAAAAAAAA 


8 


( 6) 


4 ( 


4) 


3 » UT 




1172 


15 


1519 


CTCTAGTGTT [G/C] AGGGATGTAG 


7 


( 7) 


2 ( 


1) 


3' UT 






19 


1784 


CAGGTCTTAA [T/C] GCCTCCATAC 


3 


( 3) 


2 { 


2) 


3' UT 






25 


2423 


GAGAGACTGG [T/A] GGGTCTGTCT 


7 


( 6) 


5 ( 


4) 


3' UT 




1173 


12 


A HI C\ 
1 f J U 


AGTAGGTAGG [G/T] CTAGTAGGTA 


6 


( 6) 


2 ( 


1) 


3 1 UT 






01 


981 


GCAGCCCCAG [T/C] GCACCTGAGC 


24 


18) 


48 (3 


0) 


Silent 






02 


1041 


ACATCAAGAG [A/GJ TACCTGGGCG 


71 (36) 


1 ( 


1) 


Silent 






03 


2400 


AGCTGAGTGC [C/T] GCCACCACCT 


71(36) 


1 { 


1) 


Silent 






04 




4 bp deletion 


















o ccn 

«3Q / 


CTAGATAGCA [A/GJ ATAGCTCTCA 


71(36) 


1 1 


1) 


3 'UT 






06 


2888 


CCCAAGCTGC [C/T] TCATGGCCCG 


63 (36) 


9 ( 


9) 


3 'UT 




1174 


24 


3200 


TGTTGACAGG [G/C] TTTTTAAGAA 


10 


( 8) 


2 ( 


2) 


3' UT 






27 


3302 


TCTGCCCAAGC [A/C] AAAAAAAAA 


5 


( 3) 


3 ( 


2) 


3' UT 




1176 


13 


2571 


GAGGCTTTGC [C/T] TTGCCTGCAT 


6 


( 4) 


3 ( 


3) 


3' UT 




1177 


18 


1684 


CTCTTCCCCC [T/C] AAAAATGGTA 


13 


(10) 


3 < 


3) 


3' UT 






21 


1664 


GTTAGCTTTA [A/G] AAAAAAAAAA 


5 


( 5) 


3 ( 


3) 


3' UT 




1181 


8 


678 


TACCAAAGCA [G/A] GGGTTCCCCA 


10 


( 7) 


2 ( 


2) 


Arg -> 


Lys 


1183 


18 


1719 


CTTCCTGCTC [G/A] ACTGAAAAAA 


14 


( 9) 


2 ( 


1) 


3 ' UT 






21 


1799 


TGGCTTTCAG [G/C] CCTGGCCTTT 


15 


(10) 


5 < 


4) 


3' UT 
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1184.14 2292 GCCTAAATGT [G/T] TGAAGTGCGA 30 (18) 2(2) 

1186.7 1337 GGGAGAGGTG [A/G] CCCTGAGGGA 2(1) 4(3) 

1188.7 1601 AGTCATCTGA [G/A] GTTATGCTTT 4(3) 2(1) 
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Target Loc'n 


Sequence around # Varia 


1 


# Varia 


2 Protein 




TTi 






fpolymorDhioa] fLihl 




IJjXO J 


Change 




1189 


.13 


1270 


CGGAAAGGAA [G/A] CGTTGGCAGC 


11 


( 9) 


3 


( 2) 


3' UT 




.16 


1341 


AGCCCCAGGG [ A/G ] CCAATTTTCC 


14 


I 1a 7 


2 


( 1) 


3 1 UT 


1190 


.5 


1010 


GGGGTTGGGC f G/T"! GGTTCCtttc, 




\ 2 ) 


3 


( 3) 


3 ' UT 


1193 


.1 


79 


CTCTCCCCTC fC/Gl AaTPfTiTrr 


5 


( 5) 


2 


( 2) 


5 1 UT 


1196 


. ^ J 


2123 


TATCFTTTrr fT/Pl ITTTiiTUPT 
1 I. 1 X IX/ I. J Al\;LAAiAtj 1 


19 


(14 ) 


2 


< 2) 


3 ' UT 


1198 


.29 


2395 


TGGCAAAGTC [T/C] GAAATAGGTC 


20 


(15) 


4 


( 2) 


3' UT 


1199 


.3 


1012 


AGATTCAGAA [ C/T] ATGGTGGGGA 


3 


( 2) 


2 


( 2) 


Silent 




.13 


1460 




6 


( 7) 


•5 


/ *> ^ 


3' UT 


1202 


, 7 


671 


AtLAlAAUl I IX/ C J TTTTTAAGGA 


13 


( 7) 


11 


( 6) 


3' UT 


1205 


.1 


942 


GGAGAAAATT [G/A] AAGAATATCT 


13 


( 6) 






Glu -> 


1206 


. 3 


740 


ACATCACAAA [A/G] CAACCTGTGG 


3 


( 3) 


2 


( 1) 


Silent 


1208 


.3 


1984 


TATTCCGTAC (G/A] TACAATGCCT 


2 


( 1) 


2 


( 2) 


Silent 




.15 


3163 


AATTTTTTTT [T/C] TTTTTAATTA 


2 


( 1) 


15 


( 6) 


3' UT 


1214 


.9 


1566 


GCATCCTGGA [C/T] AGCAACAAGA 


5 


( 3) 


2 


( 2) 


Silent 


1216 


.8 


202 


AGCGGAGCGC [C/G] TCCCGGGACA 


5 


( 4) 






Silent 


1217 


. 3 


2545 


GCCTCTCGGC [C/T] TTTCTCCACG 


5 


( 3) 


n 
Z 


t i \ 


Silent 




. 5 


2688 


GCCGTGTGCC [C/A] ATGCTACCCT 


12 


( 6) 


*a 
J 


f i \ 
\ J J 


3 ' UT 


1218 


.10 


2757 


GCAGGCTGCC [C/T] TTTAGAGAGG 


4 


( 2) 


2 


( 1) 


Silent 




.01 


1100 


GATGTCAGTG [G/C] CCCCATGCCC 


71(36) 




1 \ 1 ) 


Gly->Se 




.02 


1287 


GCCATGCACT [C/G] ACCAACGCCA 


71 (36) 






Silent 




. 03 


3385 


TTGCCTGGAC [G/A] TTGGCCTGCG 


71 (36) 






Silent 


1221 


. 20 


1893 


TGGAGCCTTC [G/T] GCTGGAAGTC 


9 


( 7) 


3 


( 2) 


3 ' UT 


1222 


.30 


2797 


CACAAACCCA [A/G] TTGTAAATAA 


14 


(11) 


2 


( 1) 


3' UT 


1223 


.3 


2813 


AAGCAGGAGG [C/T] TAAGAAAGTG 


13 


(10) 


2 


( 1) 


N/D 




.9 


3662 


GGACCGCAGT [C/T] CAGCATTTGT 


2 


( 2) 


2 


( 1) 


N/D 




.10 


3727 




4 


( 4) 


3 


( 2) 


N/D 




.15 


3855 


ACGTCCCAAC [G/A] AAGAGACCAC 


24 


(19) 


2 


( 2) 


N/D 




.16 


4110 




20 


(17) 


2 


( 2 ) 


N/D 




20 


4155 


CGACGTGGAT [C/T] CCATCGAGGT 


21 


(17) 


2 


( 2) 


N/D 


1224 


. X .5 


1 *71 Q 
J. / J J 


GCAGAGCCAC [C/A] AGGGAAAAGT 


2 


( 2) 


2 


( 2) 


3' UT 




17 


1936 


CCTCTTCTAA [T/C] CTCAAGGGTC 


3 


( 2) 


8 


( 7) 


3' UT 




21 


2061 


GCGAGTGAGT [G/T] GAGAGCCAGC 


15 


(11) 


17 


(13) 


3' UT 




22 


2079 


AGCTCTGCGG [A/G] GTCATCACGC 


15 


(11) 


17 


(13) 


3' UT 


1227 


9 


1107 


AGAAGGTGAA [C/A] CCCCTGGGGG 


9 


( 6) 


4 


( 3) 


Asn -> 




16 


1207 


TGGGAAGAGG [G/C] CATACGGAGT 


20 


(14) 


2 


( 2) 


Ala -> 


1229 


18 


1919 


ACTCCGTGCG [C/T] AATGCCGTCA 


4 


( 3) 


2 


( 1) 


Silent 


1235 


11 


1194 


TAGCCGCCAG [G/A] ATTGCCATGA 


18 


(12) 


2 


( 2) 


Asp -> 


1238 


14 


1133 


AGAACCTGAA [G/A] GCTGCGCAGG 


6 


( 4) 


2 


( 2) 


Silent 




17 


1298 


AACAACTCCA [G/A] GCCCTGCCCC 


8 


( 6) 


2 


( 1) 


3' UT 



1239.13 1289 ACTTTTCCTC [T/C] AATCCTGGAA 11 ( 5) 7(4) 3 ' UT 
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.14 1292 TTTCCTCTAA IT/C] CCTGGAAATT 16 ( 7) 2' (""2) 3 "~UT 
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Target Loc'n Sequence around # Varia 1 # Varia 2 Protein 

ID [polymorphism] ( Lib ) ( Lib ) Change 



1241.13 1802 AATTAAAGTTTTTCTTC [ C/T] ATG 10 ( 7) 2(2) 3 ' UT 



1242.18 3296 TCCTGTCACA [T/C] GTGCAGCAGG 13 (11) 2(2) 3 • UT 

.20 3328 AGCGGGCATC [G/T] CTGCCGCCAT 7(7) 3(3) 3 1 UT 



1243.5 134 GAACGCAGTG [G/A] ATGCCTTTCG 4(4) 3(3) Asp -> Asn 

.6 184 TGCGCAGCCC[C/G]GTCACCAACA 7(7) 3(2) Silent 

.7 185 GCGCAGCCCC [G/T] TCACCAACAT 7(7) 4(2) Val -> Phe 

.24 1528 CGGTGGAGCA [G/A] CCCCTGGGCT . 10 ( 8) 3(2) 3 1 UT 

.31 1789 TACACGTGTT [G/A] CTTCGTCCAG 14 ( 9) 2(2) 3 • UT 

.32 1790 ACACGTGTTG [C/A] TTCGTCCAGT 16 ( 9) 8(7) 3 ' UT 



1246.6 1512 ATCCCGGAGG [G/T] TCACTCTGAA 2(2) 2(1) Val -> Phe 

.9 1958 ACGTTTTAAC [A/G] TAGTAAATCC 3(3) 6(6) 3 ' UT 



1247.6 517 GCGGACAGTA [C/T] ATTGCCATTG 2(2) 2(2) Silent 



1248.4 164 TGATGTCCCC [C/T] TTCGACCCGT 4(3) 2(2) Silent 

.5 172 CCCTTCGACC [C/A] GTCGGGTTTG 2(1) 3(3) Pro -> Gin 

.11 815 AGCACAGCCC [C/T] TCTACCAGGG 13 ( 7) 2(2) Silent 



1249.1 50 ACCGCCTGCG [G/A] AGTAACTGCA 4(3) 2(2) 5 ' UT 

.26 1800 TTGTAAAAGG [G/T] TTACTCTCAT 26 (16) 2(1) 3 ' UT 



1250.1 353 GCCCCGCCAG [G/A] ATTAACACAG 3(2) 2(2) Silent 



1251.11 1070 CCGCCAACGG[C/A]AACATCGACC 2(1) 4(2) Ala -> Glu 

.18 1974 CTGGGAAATG [C/A] GGGACTGGAA 2(1) 2(2) 3 ' UT 



1253.7 673 GCCAGGTGGT [G/C] CAGATCCCTG 2(2) 2(1) Silent 

.11 1620 GCCTATGTCG [G/A] CGACGTCCAC 2(2) 2(1) Ala -> Asp 

.13 1672 ACACCAAGAC [ C / T ) ATGG AG CTG C 2(2) 2(1) Silent 

.16 3427 TCGACCACGC [G/A] GAGCGGGAGC 2(2) 2(1) Silent 

.21 3 84 8 GACCCCGCTG[C/T]CACCCGCTTT 2(2) 2(1) 3 1 UT 



1255.11 895 TCAAATGAAT [C/G] AACCACCTGG 2(2) 2(1) Gin -> Glu 

.23 1729 TCATTTTTCT [A/G] TATAGGCTGC 2(2) 17 ( 8) 3 ' UT 

.24 1731 ATTTTTCTAT { A/G] TAGGCTGCAC 2(2) 17 ( 8) 3 • UT 

.27 18 01 TTTCCAATAAAATC {G/A] GAATTC 3(2) 3(3) 3 ' UT 



1257.11 674 AACAAGAACA [ C / T ] ATGATAAATT 9(6) 2(1) Silent 

.19 954 GTGAGAGAAC [G/C] AAATCTCTAT 21 (14) 3(2) 3 ' UT 

.20 955 TGAGAGAACG [A/C] AATCTCTATC 19 (14) 3(2) 3 1 UT 



1258.11 329 ATCACAGCAA [A/G] AGAGAGGTTC 22 ( 9) 4(1) Lys -> Arg 

.15 357 TCACTACCAA (C/T) CTGATCAATT 24 (10) 6 ( 3) Silent 

.17 422 TCTGCCTTTT [C/T] TACCATGATG 25 (11) 2 ( 1) Ser -> Phe 

.20 533 AGCTTCCTAA [G/A] TCAAGGCCAA 27 (13) 2 ( 1) Ser -> Asn 

.32 745 G CTTCCAGAA [ C / G ] AGATCAAAAA 17 (10) 2(1) 3 ' UT 



1261.6 425 CTGG CATCAT [ C / T ] G CCATCTACG 9(3) 2(1) Silent 

.20 908 CGCCCCTCCA[G/A]GCCCCCGGCG 8(3) 3(3) 3 ' UT 



1265.1 46 ACTCGAGCCT [G/A] CTGTTCACCG 3(2) 2(1) 5 ' UT 

.19 1023 GGAGGGGGCA [A/G] ATGGTGGTTG 2(1) 20 ( 7) 3 ' UT 



1266.1 343 CGCTGCGGAC [G/A] AAAAGGCCAA 2(2) 3(2) Glu -> Lys 

.7 661 AGCAGGTGAA [G/A] GGCATCGCTG 7(6) 4(3) 3 ■ UT 

.9 671 GGGCATCGCT [G/T] CCCCAGGCCT 10 ( 9) 4(3) 3 » UT 

.16 865 GTAGAGCACA [G/A] GGGTTTCCCC 25 (12) 2(2) 3 ' UT 
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Target 
ID 


Loc'n 


Sequence around # Varia 
[polymorphism] (Lib) 


1 


# Varia 
(Lib) 


2 Protein 
Change 




1267 


.11 


1776 


GGCTAGAGGA [T/C] GCACGGTGGC 


2 


( 2) 


7 


C 5) 


3 ' UT 


1268 


.10 


6529 


TTCATCCTCA [C/T] TCCCCACATC 


10 


< 6) 


2 


{ 2) 


Thr -> 


1269 


.19 
.20 


1893 
1941 


CAACTTCAAC [C/G] TGGAGGTGCA 
TAAAAAGGTG [A/G] CTGTTTTATA 


12 
12 


( 4) 
( 4) 


3 
4 


( 3) 
( 4) 


3 ' UT 
3 * UT 


1270 


.11 


331 


TTGTCCTCAG [T/C] ACCTCTCCGT 


11 


( 9) 


2 


( 2) 


5' UT 


1271 


.14 


949 


GGGTGTATTA [T/C] CCAGGTACTC 


18 


(11) 


5 


( 1) 


3 ' UT 


1272 


.10 


2678 


TGTTAAGGAA [C/Tj GCTAGCAGGG 


3 


( 1) 


3 


( 1) 


3 » UT 


1273 


13 


3127 


AAAGGAAGTT [T/C] TCCTTTTGAA 


7 


( 2) 


10 


{ 3) 


3' UT 


1274 


16 


2696 


ATATTTTTTC [A/G] TAATCTATAT 


7 


( 6) 


3 


( 2) 


3 ' UT 


1278 


7 

32 
33 
34 


864 
3897 
3898 
4013 


AGTGTGACCC [G/A] GACTGCCTCC 
CCAGAACACG [G/C] CTCACGCTTA 
CAGAACACGG [C/G) TCACGCTTAC 
TGTTGTGTGT [A/G] TCGAGAGGCC 


3 
5 
4 
10 


( 1) 
( 3) 
( 3) 
( 7) 


2 
3 
4 
3 


( 2) 
( 3) 
( 4) 
< 2) 


Silent 
3' UT 
3' UT 
3 • UT 



He 



.5 164 8 TTAAGAGGAC [G/A] TAATGGGGTC 14 ( 8) 

.15 1957 TAAAGATGATTGTGG [G/A] AATTC 2(2) 



1282.1 


2155 


TTTGGTGGGC [C/T] TACTTGGTGC 


7 


( 3) 


.2 


2283 


GTGTGGCGTA [G/C] GCAGTGGGTC 


13 


< 1) 


.9 


2799 


TTACATCACC [G/A) CCACTACTGC 


6 


( 3) 


.10 


2824 


CAGTGCCCAG [T/C] GGCCGCATGC 


4 


( 1) 


.15 


2937 


TGGTTTTGTT [G/C] CCTGACACAG 


11 


( 4) 


1284.1 


249 


CTGTCGACGA [T/C] CCCTACGCCA 


7 


( 7) 


.6 


522 


GGGGCAGTGC [G/C] GTCATCTCCC 


5 


( 1) 


.7 


523 


GGGCAGTGCG [G/T] TCATCTCCCT 


7 


( 4) 


.10 


608 


GCCCTTGGGG [G/T] TTGCAGGCTG 


B 


C 7) 


.20 


651 


GGGCTGGGGG [G/A] ATCCCAGCAG 


6 


( 8) 



6 


{ 


1) 


3 ' 


UT 


2 


( 


2) 


3 ' 


UT 


2 


( 


2) 


3 ' 


UT 


3 


( 


3) 


3 ' 


UT 


3 


( 


1) 


3 ' 


UT 


4 


( 


3) 


Silent 


5 


( 


4) 


Silent 


4 


( 


1) 


Val 


-> Phe 


2 


( 


1) 


3 • 


UT 


2 


( 


2) 


3 • 


UT 



1286 


.20 


5366 


GGCCATTGCC [G/A] CAGTCGCAGC 


12 


(11) 


2 


( 2) 


3' UT 


1287 


.10 


864 


AGGGATGTTAGACGGAATT [C/G] C 


2 


( 2) 


4 


( 3) 


3' UT 


1289 


.15 
.22 


885 
1006 


ATCATGTGGA [G/A] GGGCCAGAGG 
GGCATTCCAG [C/G] TGAGACACTG 


13 
21 


( 9) 
(10) 


2 
5 


( 1) 
( 2) 


3' UT 
3 ' UT 


1290 


.7 


929 


CCCTCACCCC [A/G] TCACGCCTCG 


3 


( 1) 


2 


{ 2) 


3' UT 


1291 


.5 
.8 
.12 
.13 


1060 
2166 
4517 
5114 


TCAACAAAAA [G/A] GGACAGGTAC 
TAAGTACCAC [G/A] AGCAGCTGGG 
GCTGACAGAG [G/A] AGGAGGACTA 
CCAGCCTCCA [G/A] TGTACAACTT 


2 
2 
5 
4 


( 1) 
( 1) 
( 2) 
( 1) 


2 
2 
2 
2 


( 1) 
( 1) 
( 1) 
( 1) 


Silent 
Ser -> 
Ser -> 
3' UT 


1292 


11 
20 
21 


3547 
3888 
3689 


AGGCAAATTC [A/G] ATTTGAACAT 
TGTGTGTGTG [T/G] GCTGTCGCTT 
GTGTGTGTGT [G/T] CTGTCGCTTG 


7 
11 
11 


( 3) 
( 9) 
( 9) 


5 
3 
4 


( 3) 
( 3) 
( 3) 


3' UT 
3' UT 
3' UT 


1293 


10 
11 


2480 
2481 


CATGCCTGTG [C/G] GTGCGCTTCC 
ATGCCTGTGC [G/C] TGCGCTTCCT 


2 
4 


( 2) 
( 4) 


3 
2 


( 2) 
( 1) 


3' UT 
3* UT 


1298 


20 


960 


TTCAG TGGGC [T/C] TTTCTGGCAG 


12 


( 8) 


2 


( 1) 


Leu -> 


1300 . 


7 


566 


AAGTGTACCT [T/G] GAATTCTTTG 


2 


( 2) 


4 


( 2) 


N/D 



Lys 
Lys 



Pro 



WO 98/41648 



191 / 214 



PCT/US98/05419 



Target Loc'n Sequence around # Varia 1 # Varia 2 Protein 

ID [polymorphism] (Llb , {Llb) change 

1301.12 668 CGCCCGGCTG [O/C] GCAAGGAGAT 9 ( 5) ""3" Ala"> Ara 

.30 1058 CAAGGTCTAT [G/C] CTGACGCCTC 16 ( 7) 3(2) Ala -> Pro 

.31 1059 AAGGTCTATG IC/G] TGACGCCTCC 13 ( 6) 3(2) Ala -> Val 



1302.7 759 ACAGGCCACA[T/G] CTGGACCATC 2(2) 5(5) Ser -> Ala 

.8 806 TATCAACTCC [C/T] GGACAACCCA 2 { 2) 4(4) Silent 

.10 866 TTCGAAGAGT [T/C] ATTGCCAAGA 4(4) 2(2) Silent 

.17 2000 GAATTTAATA [ G / T ] GTACAGAAG T 5(5) 4(4) 3 • UT 

.19 2158 ACTTCTAAAG [C/A] AAGAGGATAA 8(7) 9(9) 3 ' UT 

1303.5 1226 TGCTGTGCAC [A/G] TTGACTACAA 6(5) 2 "2) Ue"> Val 

.15 1624 GATTATATAT [T/A] TTTTTTTCTG 7(5) 3(3) 3 ' UT 

.21 1813 GTGCACTAAT [A/G] TGTAAGACAA 9(6) 3(3) 3 . m 

•22 1920 TTAAATAGCT [C/T] TTTTCTCTGA 2(1) 14 ( 8) 3 • UT 

.23 2079 TCTATAAACC [A/G] AACTGATGTA 2(1) 16 ( 9) 3 ■ ur 

1305.12 1434 AATAAACTATAGTAGTGTT [T/A] T 8 ( 8) "l"('V) 3~~UT~ 

1306.14 407 TTTGATATTG [C/T] CTCTGGAACT 2 ( 2) '"VW) Ala"-> Val 

• 21 1Q 21 TTTTTTTGCA [A/T] AAAACTAAAT 2(2) 4(3) 3 ' UT 



1309 


.4 
.5 


466 
494 


GCGGGCCGCC [T/C] GCTCTTGGAG 
AGGAGTATGC [G/A] GCTCGGGCCC 


5 
4 


( 5) 
( 3) 


2 
3 


( 1) 
( 3) 


Leu -> 
Silent 


Pro 


1312 


.10 


492 


ACCCCTGGGG [G/A] AGTGCATCAT 


7 


( 6) 


3 


( 3) 


Ser -> 


Lys 


1315 


13 
22 


339 
766 


AAGTTCCTCA [C/A] GCCCTGCTAT 
TCCTTTTTTA (A/G] AAAAAAAAAA 


13 
8 


(10) 
< 7) 


2 
3 


( 2) 
( 3) 


Thr -> 
3' UT 


Lys 


1317 


4 


1083 


GATAGATTAT [G/A] TATTCTTCCA 


3 


( 3) 


4 


( 3) 


N/D 




1318, 


2 


183 


GGGAGCCTGC [C/A] AGGGTCCGCT 


12 


(11) 


3 


( 3) 


Silent 





876 TGACTCCACA [G/A] CCTCAGCCGA 23 (14) 



1326.5 


139 


GGCCTGGAAA [C/T] TTGCACAGTC 


5 


( 


5) 


.12 


1339 


TAGGAAAGAC [G/A] TCGGCTTTCG 


5 


( 


2) 


.17 


2214 


TCCCCAGGGT [T/C] TTCTCATGGT 


2 


( 


2) 


.19 


2333 


ATTCTGAGGG [A/G] TATCCAGCAG 


4 


( 


4) 


1328.5 


2968 


CCTAAAAGTG [T/G] TTTTTATTTC 


6 


( 


4) 



3 ( 1) Leu -> Phe 

3 ( 3) Val -> He 

5 ( 3) Silent 

4(2) Asp -> Val 

4(4) 3' UT 



1330 


.13 


1526 


TTGATCATGA [O/A] ACATAGGTAT 


6 


( 


3) 


2 


( 1) 


3* UT 


1331 


.15 


1666 


ACAAGCACAC [C/G] TTAGAGGCTT 


2 


( 


2) 


10 


( 4) 


3' UT 




.24 


2009 


CTGCTGATGC [C/T] GTACCCTCAC 


13 


( 


7) 


2 


( 2) 


3' UT 


1332 


.5 


618 


AGCTGAACCC [G/C] GAGTCCTCCC 


2 


( 


1) 


2 


( 1) 


Silent 


1333 


.4 


89 


GAGCACAGCG [G/A] CATCTTTGGC 


7 


( 


5) 


2 


( 2) 


Ala -> 




.10 


279 


CCGTGCAGGC [C/A] ATGAACCGCA 


5 


( 


5) 


6 


( 5) 


Silent 




.24 


756 


TGACCCCCGA [C/A] CCAGCCTCGC 


6 


( 


6) 


7 


( 6) 


3 » UT 


1335 


.1 


331 


AGGGCTGGCC [C/T] TTGGAAGGCG 


4 


( 


4) 


2 


( 2) 


5' UT 




.13 


872 


AGCCAAGCCG [Q/T] TCAAGGCATC 


7 


( 


6) 


2 


( 1) 


Val -> 




.28 


2268 


GGAAAAGGGA [G/A] AAACTGAGCG 


6 


( 


6) 


2 


( 2) 


3' UT 


1336 


.6 


851 


GCCGCGAGGC [C/G] TGGTCTGAGC 


5 


( 


5) 


11 


t 5) 


3' UT 




.7 


889 


GGTCCTCTCA [G/A] TCTTTCCCCT 


21 


(10) 


2 


2) 


3' UT 




.15 


990 


TTGGCAACGG [C/T] CGTCGTCATG 


17 


(11) 


2 


1) 


3' UT 



Asp 
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Targat Loc 

ID 


'a Saquance around # Varia 1 
[polymorphism] (Lib) 


# Varia 2 

(Lib) 


Protein 
Change 


1337.12 


420 GCAGTCATGC [C/G] GGGTGATCGT 32 


(15) 


3(2) 3' UT 



133 9.17 2972 TATTAGTCCA [A/G] TGAGATTTCC 
.20 3146 GTCGGACAGT [G/T] GCTCATAGAG 



1341.3 630 CTCGTAAGGC [G/T] TCCGGTCCCC 4(4) 6(3) Silent 

.4 633 GTAAGGCGTC [C/T] GGTCCCCCGG 10 ( 9) 4 ( 2 ) Silent 

•17 B96 AAAAAGGCGG [G/C] CGGAACCAAA 22 (14) 2 ( 1) Silent 

.29 1107 AGGCTGTGAA [G/A] CCCAAGGCCG 13 ( 8) 2(1) Silent 

.32 1195 AAACCCAAAA [G/A] GCTCTTTTCA 7(5) 5 ( 3) 3 . ur 

1342.5 142 GCGCCAAAGC [G/A] AAATCCCGCT 11 ( 9) YYY) Sillnfc 

.7 227 CGCAGAGCGG [G/T] TTGGGGCAGG 4(4) 5/4) Val => Ph. 
.8 271 TGTTAGAGTA[C/T] CTGACCGCCG 11 (11, 4 < 2) Silent * 
314 ^ CGCGGCTCGCEG/AJACAACAAGAA 8(8) 2 ( 2) A^p > Asn 

1343.17 514 GAACTCAAAA [G/A] GCTCTTTTCA 7""?) YYY) 3'"ut" 

1344.2 149 GAGCGCATCG [C/G] GGGAGAGGCT 2 ("2) YYYY'"YYa~Y > Gly 

1345.3 360 GGCGCGGTGG [G/C] GTCAAGCGCA 3 ( 3) 3T1)" ~Gly"> Ala 

1346.1 2269 CAGACTGGTG I A/0) ACGAATATTC 2 ( 2) 2" (2) Asn - > Aso 

.2 2407 CTCTGAGACG [A/C] TGAAGACCCG 2(2) 3(3) Met -> Leu 

.10 3265 TGCCGGGCCT[C/T]CCTCCCGGGG 3(3) 2(2) 3 ' UT 

1347.3 107 GAAGCCGAGA [C/G] GGAAAATGTC 12 ( 8) V"l] Arq"-> Gly 
.5 109 AGCCGAGACG [G/A] AAAATGTCAT 2(2) 3(3) Silent 

.6 111 C CGAGACGGA [ A / G ] AATG TCAT CA 16 (12) 2(1) Lvs > Arc 

-37 994 GGTTCTTGTT [T/G] GGGCACAGCA 16 (11) 3 ( 3) 3, ^ 

.38 996 TTCTTGTTTG [G/T] GCACAGCACA 17 (11) 4(4) 3 ' UT 

1349.4 351 ATCGGGATCG [T/A] GTGTTCCAGT 4 ( 1) Y"Y) Val"> Ser 
.9 113 6 GCCCTGCACG [A/G J GCCCAGGGGC 19(13) 3(3) 3 • UT 

.10 1137 CCCTGCACGA [G/A] CCCAGGGGCT 10 ( 6) 11 ( 7) 3 • UT 

.11 1150 CAGGGGCTGA [G/A] CGTTCCTAGG 20 (12) 2(2) 3 » UT 



1350 


.4 
.5 
.10 
.12 


188 

275 
473 
770 


CCAAGCGCTC [T/C] AGGGGCTTTG 
ATGGAAGAGT [T/C] GTGGAACCAA 
GGGGCTTTGC [C/T] TTTGTAACCT 
ATGGATTTGG [C/T] AATGATGGAA 


4 

15 
9 
5 


( 4) 
(10) 
( 8) 
( 5) 


12 
2 
3 
2 


( 
( 
( 
( 


7) 
1) 
2) 
2) 


Silent 
Silent 
Silent 
Ala -> 


1351 


.25 


1695 


GTGTGGAGAA [G/A] CCACAGGCCT 


10 


( 7) 


10 


( 


8) 


3' UT 


1354 


.23 


2233 


CAACAATTTT [C/T] TATGTTAGTT 


7 


( 6) 


3 


( 


1) 


3* UT 


1355 


.7 
.8 
.10 


4296 
4778 
4785 


AGCCTTCAGG (C/T] TCGGGGGGCT 
GCGCTGATAA [C/G] GTTCATGGAA 
TAACGTTCAT [G/A] GAACGCGTTG 


2 
3 
5 


( 2) 
( 3) 
( 5) 


2 
3 
2 


( 
( 
( 


1) 
3) 
1) 


Ala -> 
3' UT 
3 • UT 


1358 


.8 
.17 


2515 
2629 


CAGGGCGAGT [G/C] GCATGTCTGC 
CTTGGCATGT [G/A] ATGGCAGCTC 


7 
20 


( 7) 
(17) 


2 
2 


( 
( 


2) 
2) 


3' UT 
3' UT 


1359 


.3 


297 


ATAAATACAA [G/A] AACATTGGAG 


3 


( 2) 


2 


( 


2) 


Silent 


1360 


12 


54 8 


TGTAAGCTGA [G/C] CCTGGTGGCC 


8 


( 6) 


2 


( 


1) 


3 • UT 


1361 


10 


4077 


CTGTCTTTCC [A/G] TTTTTTCATG 


14 


( 9) 


2 


( 


1) 


3' UT 


1362. 


9 


1832 


CCGCCAGGCG [G/A] ATTTTGTTCA 


2 


( 2) 


2 


( 


2) 


Silent 



Val 



Val 
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Target 


Loc'n 


Sequence around # Varia 


1 


# Varia 


2 Protain 






ID 




[ polymorphism] ftifei 






Change 






.11 


2248 


CCTATCGGCT [C/G] TTTGCAGTGG 


3 


( 9 } 


3 


( 3} 


Leu -> 


1363.22 


2874 


CCGGAATCCA [A/C] AGTGCTCTGC 




\ * ) 


7 


( 5) 


3 


UT 


1366.3 


615 


CGCCCATGGC (G/Al ACCAGTACAA 


n 

1 




2 


( 2) 


Asp -> 


.6 


722 


TGTACAACTT [T/C] CCCGCAGGCG 


2 


( 2) 


8 


( 7) 


Silent 


1367.18 


1851 


AAAAAGTAATTCCTTAAA [C/A] AT 


4 


( 4) 


4 


( 3) 


3' 


UT 


1368.5 


2964 


TCTGAGACAC [G/A] CCCCAACATG 


3 


( 3) 


2 


( 2) 


3' 


UT 


1372.1 


276 


AGATGCTAAG [A/0] TTACCTTTCC 


4 


( 3) 


2 


f 2 ) 


He -> 


1373.13 


3855 


AATATAATAT [C/T] GACACAGTGC 


4 


{ 4) 


2 




3' 


UT 


1378.12 


4157 


TGCTGGGGCA [T/C] GGCGGGATCC 


2 


( 2) 


2 


( 1) 


3' 


UT 


1383 .14 


1832 


ATCACCACCA [C/T] GTGAGTGGTA 


12 


{ 6) 


4 


( 3) 


Silent 


1385.17 


3454 


CAGTGCTAAT [G/A] TGTGCAAGCA 


7 


( 5) 


4 


( 3) 


3' 


UT 


1386.31 


470 


GGGTGACGGG [C/G] CCATGGGGCG 


5 


( 5) 




\ J J 


3' 


UT 


1387.5 


1385 


TCGGTGCAGT [T/C] TCCACTCTTG 


2 


( 2) 


2 


( 2) 


3 1 


UT 


.7 


1678 


CAGGCTCATC [C/A] TGGGAGCTTT 


3 


{ 3} 


5 


( 3) 


3' 


UT 


.8 


1900 


CAGCCCTGCT [G/A] ACCATCTCAC 


4 


( 4) 


2 


( 2) 


3' 


UT 


.11 


1967 


GCCCCCTGGG [G/A] AGTTGGGGAA 


17 


(13) 


2 


( 2) 


3 » 


UT 


. 15 


2 075 


ni lit,! rLLT LQ/T J GTGGCATTAG 


18 


(14) 


3 




3 ' 


UT 


.17 


2089 


GGATTAGCCA [C/T] TCCCTGCCTC 


22 


(15) 


2 




3 ' 


UT 


. 22 


2234 


AA«AwA(jAvjAGA IA/GJ AAAAAAAA 


13 


(10) 


6 


( 4) 


3 * 


UT 


1388.17 


2799 


CACAGAAGCA [G/C] CTAAACCAAG 


15 


(11) 


4 


\ -l J 


3 ■ 


UT 


13 95 . 4 


327 


v-AATGTGTTA [ T/C J GTAGTGCTTA 


35 


(17) 


2 


{ 1) 


3 ' 


UT 


1396 .10 


1887 


GGCACGAGCC [C/T] TCCTTCTATA 


3 


( 3) 


3 


( 1) 


3 ' 


UT 


.12 


1921 


CCCCAGTGGG [G/A] ACTGAGTTAT 


3 


( 3) 


5 


( 2) 


3 1 


UT 


.21 


2403 


TGACCAGGAC [G/C] CCTCTGGCCC 


2 


( 2) 


3 


( 3) 


3 1 


UT 


.26 


2579 


AAAGGCTGAA [T/A] TGTCTGAAAA 


10 


( 7) 


3 


( 1) 


3 1 


UT 


1397.23 


6232 


TATTCAGAGT [G/T] GGCTGGGCCC 


3 


( 3) 


2 


( 2) 


3 ' 


UT 


1399.2 


177 


CCCCCGAGGG [G/A] ATGCCAAGAT 


3 


( 3) 


2 


( 2) 


Asp -> 


. 10 


1136 




3 


( 3 ) 


4 


< 4) 


Silent 


.16 


1279 


CTGCTGTAAA [G/A] GCTGCAGCCT 


8 


( 8) 


2 


( 2) 


3 ' 


UT 


1401.3 


71 


CCAAGAATCT [G/A] CTGCGCATGA 


2 


( 2} 


3 


( 3) 


Silent 


.17 


874 


TTATGTTTAT [G/A] TTTATTATGT 


8 


( 6) 


6 


( 4) 


3 * 


UT 


.19 


917 


TTGGAATCAA [G/A] TGTCATAAGA 


8 


( 7) 


5 


( 4) 


3 1 


UT 


.21 


1081 


TCTACTTTCA [A/C] AAAAAAAAAA 


2 


( 2) 


7 


( 6) 


3' 


UT 


.23 


1083 


TACTTTCAAA [A/T] AAAAAAAAAA 


2 


( 2) 


3 


( 3) 


3' 


UT 


1404 .12 


3921 


TGTTGCACAC [T/C] AGCCTTACAG 


3 


( 3) 


2 


( 2) 


3 • 


UT 


1405.15 


4823 


GTCCACATGC [A/G] CTGGGCGTCT 


4 


( 4) 


12 


(10) 


3 • 


UT 


1406.5 


4618 


TGCTTTCTAG [G/C] TCAGTCCCTG 


5 


( 3) 


6 


( 4) 


3 ' 


UT 



Val 



Asn 



Val 



Asn 



1407.5 40S CCCAGGGGGG I G/C] AGCTCCCATT 5(4) 2(2) Ser -> Gin 

.9 713 TCTCTCAGAG [G/A] AAGTTTTTGG 10 ( 7) 2(1) Silent 
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Target Loc'n Sequence around 



# Varia 1 # Varia 2 Protein 



ID 






[polymorphism] (Lib) 




t T A VI 
VJjIDJ 


Change 








.18 


1053 


GGGCAGGGAA [T/C] CCTGGAGCAC 


21 


(13) 


2 


( 2) 


3 ' UT 






.21 


1144 


GTGGGGTGGG [G/Aj TGAGTAGGAC 


2 


( 2) 


25 


(14) 


3' UT 




1411 


. 4 


2009 


GGCGTCAGAG fA/Gl Tfinvw^Tfia. 


6 


( 4 ) 


7 


( 5) 


3* UT 




1414 


.13 


930 


ACATACGAAC [C/T] GCCTCCTTCC 


16 


(13) 


3 


( 2) 


3' UT 




1415 


.24 


1362 


GTGCGATTCT [A/G ] GATAAAGCCA 


7 


( 5) 


3 


( 3) 


N/D 






.26 


1442 


GAGAATCCCT [G/A] GCAAAGGGAG 


10 


( 8) 


3 


( 3 ) 


N/D 




1420 


. 6 


461 


uwwvjuyrtuu IM/ 1 J 1 vj>v\L«AAr\oA 


2 


( 2) 


2 


( 2) 


Arg -> 


Leu 




.8 


685 


TGGTGGCAGT [G/T] TGGGCTCTCA 


12 


( 8) 


2 


( 1) 


Val -> 


Leu 




.9 


689 


GGCAGTGTGG (G/CJ CTCTCAGCCA 


15 


(12) 


2 


( 2) 


Silent 




.16 


853 


GCTGGCAGCT (G/T] TGAGGCTCTA 


25 


(19) 


2 




Val -> 


Leu 


1421 


. 8 


169 


1 A I nUlu I A/ U j AUfViATTACA 


20 


(14) 


2 


( 1) 


Silent 






.25 


1166 


GTTAGTTTTC (T/C] GGCCCGTGGC 


4 


( 3) 


3 


I 9 \ 


3 ' UT 






.26 


1167 


TTAGTTTTCT [G/T] GCCCGTGGCC 


4 


( 3) 


±± 


\ f ) 


3' UT 






.29 


1275 


i lhjuuhial lL./t#J liATAGGCTTA 


6 


( 5) 


14 


111) 


3' UT 




1422 


. 7 


278 


uvahjuaacuvj l(*/CJ UCACCATCAA 


4 


( 3) 


•a 


( 1 \ 


Ala -> 


Pro 


1424 


.3 


1012 


WaWVbWiiij L. IT/ GJ CTCTCTCGCG 


2 


( 2) 




V J J 


Silent 






.4 


1021 


CTCTCTCTCG [C/T] GTAGCTGGAA 


5 


( 3) 




f 1 ^ 
I a; 


Silent 






.7 


1295 


t» 1 1 1AATGCA LT/ A J GGATTCGAAA 


2 


( 2) 


3 


( 2) 


Trp -> 


Arg 


1425 


.3 


274 


GCACTGGAGG [G/T] TTTAATTTTG 


2 


( 2) 


2 


( 2) 


Gly -> 


Val 


1426 


.2 


1364 


GATCACCAGA [T/C] ACCAGGGTGT 


9 


( 6) 


2 




Tyr - > 


His 




.17 


2298 


TCTCCAGAGT [C/T] ACTCCGTTCT 


4 


( 4) 


3 


( 3) 


Ser -> 


Leu 


1427 


.3 


90 


CGCCGGCTGC [G/CJ CTGCAGGTGA 


8 


( 6) 


3 


\ i. ) 


Silent 






.4 


91 


GCCGGCTGCG [C/G] TGCAGGTGAC 


8 


( 6) 


3 


( 1) 


Leu -> 


Val 




.6 


109 


uauACjTTCGT IG/AJ ATGCTATAAA 


12 


( 6) 


2 


( 2) 


Asp -> 


Asn 




.11 


438 


TCTTCAGGGG [A/G] CCCAATGGTG 


7 


( 2) 


2 




Glu -> 


Gly 


.23 


1172 


CTATTCATAA [A/C] GGAAAACGAT 


10 


( 5) 


12 


( 7) 


3' UT 


.24 


1179 


i/wuttioiww LU/TJ VjATTTCTAAA 


21 


C10) 


2 


( 2) 


3 ' UT 






31 


1323 


i-nftHiiAiAI lw AJ AuAl 11 1ATC 


8 


( 3) 


13 


(10) 


3' UT 






34 


1376 


vjuavxhvj iV,v-l lU/uJA ioAAAGATG 


13 


( 7) 


5 


( 4) 


3 ' UT 






37 


1433 


yuiiHiAAiA lw TJ AUATTTACTG 


6 


£ 2) 


9 


( 7) 


3 1 UT 




1430 


3 


682 


TCTTTfWVJA ff3/ll TPar'aTPfi./T' 

* ^ * * \J\J\JVJ#% l V»/ J\] 1 L-rUjAl ulAuV^ 


7 


( 6) 


2 


( 2) 


Ser -> 


Glu 


1431 


2 


79 




7 


( 6 ) 


2 


£ 2) 


Silent 






6 


296 


TCACGCAGTG FG/n r , r , AnTAATr , a. 


10 


( 7) 


7 


( 6} 


Ala -> 


Pro 


1432 


8 


2640 


AAGTTGCTTA [0/A] AGAGCCACCA 


8 


( 7) 


2 


{ 1) 


3 * UT 






9 


2695 


GTTTTAATGC [A/C] AAGGAAATTT 


12 


( 9) 


3 


( 3) 


3 ' UT 




1433. 


7 


1695 


AGCCGGGCTG [C/T] TACCTGCCCA 


3 


< 3) 


2 


( 2) 


Silent 






10 


2052 


CCCCTGGGTG [C/T] GGGGTGATCG 


2 


( 2) 


2 


( 2) 


Silent 






11 


2160 


ATGAGTCCAC [T/C] CTGGCCTTCC 


2 


( 2) 


2 


( 2) 


Silent 






23 


2698 


GGACCTTCGA [G/A] GGCCTCTGCC 


4 


( 4) 


3 


( 3) 


3 ' UT 






28 


2787 


GTGGAGGAGA (G/A) GCCTGTGGCC 


6 


( 6) 


2 


( 2) 


3' UT 






30 


2844 


GGTGGCGCAG [C/G] CTTGGTAACG 


15 


(13) 


8 


( 6) 


3 ' UT 






31 


2848 


GCGCAGCCTT [G/A] GTAACGCCAT 


15 


(13) 


8 


( 6) 


3' UT 






32 


2857 


TGGTAACGCC [A/G] TGGACTGCAG 


16 


(14) 


8 


( 6) 


3' UT 






33 


2877 


GCGACAATCA [A/G] TGGATGGTGC 


16 


(14) 


8 


( 6) 


3 ' UT 






34 


2942 


CCCTACCTGT [C/T] TTATTTCATA 


17 


(14) 


14 


( 9) 


3' UT 





1434.15 2041 ACTGTACCTT [C/T] TATGGTTTGC 2(1) 



5(4) 3' UT 
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Target 

ID 


Loc'n 


Sequence around - # Varia 1 
[polymorphism] (I*ib) 


# Varia 

litlnj 


2 Protein 
Change 






.17 
.18 


2127 
2154 


TGATTAGAAC [Q/T] GGTAGCCAGT 
AATATTGATA IG/T] AAAAATAAAA 


2 
2 


I l J 
( 1) 


5 
5 


( 4) 
4) 


3* UT 
3' UT 




1437.16 
.19 


2825 
3129 


AGTTTAAGAT [G/C] ACTTGACCCC 
CATGCGTAGC [C/T] TCTTGTCTTA 


5 
7 


( 4) 
( 5) 


3 
3 


2) 
2) 


3' UT 
3' UT 




1440.5 
.6 
.9 
.14 


940 
1327 
1906 
2282 


AACTTCAGAA [G/A] GCCAGTGTTG 
TGGCCCTGCC [T/CJ GGGAAGCCGC 
GACCTGAAGG [C/T] GAACGTGATA 
TCTTAGAGGC [C/T] TTTCTTGTAT 


2 
2 
2 
2 


( 1) 
t 1) 
( 1) 
( 2) 


3 
2 

2 ( 

3 ( 


3) 
2) 
2) 
3) 


Silent 
Silent 
Ala -> 
3' UT 


Val 


1443.4 


1943 


CTTCGTGCGA [G/A] AACCTGAGAA 


3 


( 2) 


2 ( 


1) 


Glu -> 


Lys 


1444.31 


1905 


CCAACAGCCT [C/T] CAAAGATGGG 


3 


( 2) 


28 (20) 


3' UT 




1445.4 
.25 


425 
1281 


CCAGGCTTGC [C/A] AGCCGAAACG 
AACAAAGAAA [A/T] AAAAAAAAAA 


8 

5 


( 5) 
( 4) 


2 ( 
4 ( 


2) 
4) 


Pro -> 
3 ' UT 


Gin 


1446.3 
.17 


1227 
3090 


AGGTGTGGAA [C/T] ACCCTCAGCG 
TTATTTATAT [T/C] TTTAACATAA 


2 
10 


( 1) 
< 7) 


2 ( 
2 ( 


2) 
2) 


Silent 
3» UT 




1447.8 


2681 


GGCAATAGCA [A/G] TCTTGGCTGA 


3 


( 3) 


3 ( 


2) 


3 ' UT 




1448.2 
.3 


521 
587 


AGAAGACCAC [A/G] ATGCGAGATG 
GTCATGCTCT [T/C] GCACTTTACA 


3 
4 


< 2) 
( 3) 


3 ( 
3 ( 


1) 
1) 


Silent 
Silent 




1449.20 
.28 
.31 
.32 
.33 


1261 
1447 
1652 
1653 
1654 


TGCGTAATGC [G/A] GCCGAAGAGC 
CTGAGAGCCC [C/G] AGGCGTCCGC 
TTGCAGATTG [A/C] ATAAAAAAAA 
TGCAGATTGA [A/T] TAAAAAAAAA 
GCAGATTGAA [T/A] AAAAAAAAAA 


4 

21 
8 

11 
6 


( 3) 
(14) 
( 6) 
( 7) 
( 6) 


21 (13) 

2 ( 1) 
6 ( 4) 

3 ( 3) 

4 ( 4) 


Silent 
3' UT 
3' UT 
3' UT 
3- UT 




1450.2 


156 


CCCCATGGCG [G/A] CCGCCAAGGA 


11 


( 9) 


2 ( 


2) 


Ala -> 


Thr 


1451.13 
.14 
.IB 


200 
201 
417 


GATGAGCGTG [A/T] TTCCTCTCGA 
ATGAGCGTGA [T/A] TCCTCTCGAT 
AAGTTCACAT [C/G] AACCTCATGG 


3 
3 
2 


( 2) 
< 2) 
( 1) 


31 (20) 
31 (20) 
28 (18) 


Asp -> 
Asp -> 
3' UT 


Val 
Glu 


1452 .12 
.18 
.19 
.23 


1659 
2410 
2419 
2717 


GTACCAGAGG [C/T] ATGCCTATCA 
ATTTAAGGAC [G/A] AGACCAGCAG 
CGAGACCAGC [A/G] GCTAATCCAA 
GTTAATGATG tT/A] TAATGATTTT 


4 

3 
9 
17 


( 4) 
( 3) 
( 8) 
(13 ) 


2 ( 
9 ( 

3 ( 
5 ( 


1) 
5) 
1) 
3) 


Ala -> 
Silent 
Silent 
3' UT 


Val 


1454.3 
.7 
.8 


338 
1211 
1391 


AGGGCTTTGC [C/T] TTCGTTCAGT 
CATGCTCACT [G/T] TTCTCCCCAT 
GTTTTTAAAAAAA [A/T] AAAAAAA 


3 
9 
3 


( 2) 
( 6) 
( 2) 


6 ( 

2 ( 

3 ( 


2) 
1) 
3) 


Silent 
3 1 UT 
3 ' UT 




1455.6 
.22 
.23 


294 
911 
912 


CCAGGCCTTT [G/T] TCATCTTCAA 
CAGCTCGCGA [T/A] GCCCTGCAGG 
AGCTCGCGAT [G/T] CCCTGCAGGG 


9 
13 
8 


( 8) 
(12) 
( 8) 


2 ( 

3 ( 

4 ( 


2) 
3) 
4) 


Val -> 
Asp -> 
Ala -> 


Phe 
Glu 
Ser 


1460.1 
.30 


6 

547 


AATTC [C/G] CAGAGCAACATGCCC 
GTTCTGCTTC [A/C] CCAGGAGATC 


5 
25 


( 5) 
(17) 


3 ( 
5 ( 


3) 
3) 


5' UT 
3' UT 




1461.5 
.32 


154 
1463 


TCCCCGGGGG [G/C] CTTTGGATCG 
GTGTTACTGC [A/G] TTTTGTACAA 


8 
14 


( 7) 
( 8) 


2 ( 
11 ( 


2) 
8) 


Silent 
3 » UT 




1463.3 


761 


CAGCGTGGGG [G/T] TGGCCACTCC 


2 


{ 1) 


2 ( 


2) 


3- UT 




1464.3 


21 


GCCTGCAGGC [C/T] TCCCGAGGAG 


6 


( 3) 


2 ( 


2) 


Silent 
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Target Loc'n Sequence around # Varia 1 # Varia 2 Protein 

ID [polyaorphism] (Lib) (Lib) Change 





4 


130 


GCAGACTTAT [A/Q] AGGTTGACCT 


3 


( 1 ) 






Lys - > 


Ser 




5 


132 


AGACTTATAA [G/A] GTTGACCTTA 


3 


( 1) 


10 


( 7) 


Silent 




1465 


4 


897 


AGTTCCACCC [T/C] ACAGGCATAT 


2 


( 2 ) 


3 


I 7 \ 
\ 3 i 


Si lent 






5 


1044 


TGTCTCGGTC { C/G] ATGACTCTGG 


4 


< 4) 


2 


( 2) 


Silent 






12 


1758 


GAGCAGAGGC (A/G) CGGAAGGAGT 


8 


( 8) 


3 


( 3) 


Silent 






30 


1892 


ACCCTGTCCT [A/T] TGTGGACGTT 


19 


(17) 


6 


( 6) 


Tyr -> 


Phe 




34 


1938 


ATAGACCCGT [G/A] ATCGACAAAA 


16 


(15) 


9 


( 9) 


Silent 






37 


1975 


CTGTGCCACC I G/A] TCCCGCCAGC 


21 


(20) 


6 


( 6) 


Val -> 


He 




38 


1980 


CCACCGTCCC f G/Cl CCAGCCATTC 


2 1 




5 


( 5 ) 


Silent 




41 


2014 


AGACAAGATG [T/C) GGTGATGACA 


22 


(20) 


5 


( 5) 


3' UT 






42 


2102 


******* * W l A f W J VJVJVMlVUJ/Ulu 


* J 




8 


( 7 ) 


3 1 UT 






45 


2139 


GATTGGCACC [T/C] AGTGGCTGGG 


24 


(20) 


7 


( 6) 


3' UT 




1467 


9 


2297 


CATGGAGGCA [G/A] CCAGGCCCGT 


4 


( 4) 


2 


( 2) 


Ser - > 


Asn 




11 


2353 


TAATAATATG TT/Cl ITr.fTTiWY: 


-i 
j 


\ 3 ) 


2 


( 2 ) 


Tyr -> 


His 


1471 


4 


3042 


CACCCAACCT [G/A] TCCTTACTCA 


2 


( 2) 


3 


( 1) 


3' UT 




1473 


9 


390 




13 


(11) 


5 


( 3) 


Silent 






10 


399 


CCATTCTCAA [G/A] GCCCAAGTGG 


11 


( B) 


3 


( 3) 


Silent 




1474 


1 


8 


TCT [G/A] AACGGAGAGCGTAGTGA 


13 


(10) 


4 


( 3) 


5' UT 






2 


9 


CT (A/T) ACGGAGAGCGTAGTGACC 


14 


(11) 


3 


( 3) 


5' UT 






9 


94 


GCGAGAGGAG [G/T] AGGAATTTAA 


27 


(14) 


2 


( 1) 


Ser - > 


* ♦ * 




24 


370 




21 


(15) 


3 


( 2) 


Leu - > 


Phe 




.26 


392 


AAGTAGGGGC [ C/A] GCCTGTCTGT 


28 


(14) 


2 


( 1) 


3' UT 




1476 


6 


230 


CACAAGTGCC [C/T] TTCGAGCAGA 


12 


( 9) 


2 


( 2) 


Silent 




1477 


.20 


1470 


ATTTGATGGA [G/C] GCTGCGCCGG 


31 


(12) 


6 


( 4) 


Ser -> 


Asp 




.24 


1480 


GGCTGCGCCG fa/m ArtTr:fi.ar:ar:n 


1 A 
J 4 


(14 J 


2 


( 2 ) 


Ser - > 


Gin 




.28 


1647 


TTCCTGTTGA [A/T] AAAAAAAAAA 


9 


( 6) 


3 


( 2) 


3 ' UT 




1478 


.19 


838 


TATGGAAGTA [G/A] CTCCGCAGAG 


17 


(11) 


2 


( 2) 


Ala -> 


Thr 




.29 


1009 


TCCTCAGCTC [C/T] CTGCCTGTTT 


26 


(18) 


2 


( 1) 


3' UT 






.30 


1095 


AATAAACTCTTAAAGA [G/A] CCTT 


2 


( 2) 


24 


(16) 


3' UT 




1480 


.17 


913 


AAGAGGCACT [G/T] TAGCAGCTGC 


17 


(13) 


2 


( 2) 


Val -> 


Leu 




. 18 


939 


TTGCTGCGAC [T/C] GCCAGTATTG 


IB 


UJ / 


z. 




Silent 






.19 


979 


CCCACCAGGA tC/A) GGGGCACTCC 


17 


(12) 


4 


< 4) 


Silent 






.20 


980 


CCACCAGGAC [G/C] GGGCACTCCG 


11 


(10) 


4 


( 4) 


Arg -> 


Pro 




.29 


1113 


TAGGCATGCC [G/C] CCTCCGGGAA 


20 


(13) 


2 


( 2} 


Silent 




1483 


.12 


1969 


ACTTCTCCAT [C/T] CGGTCCCTAG 


2 


( 1) 


2 


( 2) 


Silent 




1484 


.2 


14 0 


ATTACGATGA [G/A] GAGGAAGAGC 


3 


( 2) 


12 


( 8) 


Ser -> 


Glu 




.7 


288 


CTGTGGCTTG [G/A] AGCATCCTTC 


8 


( 7) 


2 


( 2} 


Ser -> 


Lys 




.11 


674 


AGCACTTTGT [G/C] CTGGACGAGT 


3 


( 3) 


2 


( 2) 


Silent 


1486 


.24 


6427 


GCATTAACTA [A/T] AAAAAAAAAA 


5 


( 5) 


7 


( 5) 


3' UT 




1487 


.15 


2896 


GCGCCAAGCC [ C/A J AGCAGGCTAC 


3 


( 3) 


3 


( 1) 


Pro -> 


Gin 




.20 


3303 


AGCCACGGGC [G/T ] TCCTACTGAG 


e 


( 7) 


3 


( 3) 


Val -> 


Phe 




.22 


3394 


CTGGGGAAGC [T/C] CCTGGAAGCC 


11 


(10) 


2 


( 2) 


Leu - > 


Pro 


1489 


.14 


1419 


ACTCAACTCA [C/A] GGTACAAGAC 


7 


( 5) 


3 


( 3) 


3' UT 




1490 


.6 


443 


AGGCTGCTCG [T/C] GTTGCTATTG 


2 


( 2) 


2 


( 2) 


Val -> 


Ala 




.31 


1710 


CTCGTGATGC [A/0] TCTACAGTTA 


11 


( 7) 


19 


(12) 


3' UT 
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.33 1824 GTGGGGGTAC [C/T] ATCTCAACTG 7(4) 13 ( 9) 



1491.21 1488 G CATATGGGA I G / C ] CCATTGGCTG 11 ( 8) 2(2) 

.31 1826 TGTAAGGTTT (C/T] CATTTAGTTT 28 (16) 3 ( i) 



1495.3 391 CAAAAACCCC[G/A]CCCX3CTCCAA 3(2) 3(2) 



1496.5 3017 AATAATAACC [A/G] AGACTTTTCA 6(4) 2 { 2) 

.15 3 932 CTGCCTGGCC (C/T) TTTTTTCTTC 3(1) 6(5) 



1497.13 1332 GCCCCATGTC [G/Aj CTGGGTGGGC 3(2) 5(5) 

.14 1338 TGTCGCTGGG [T/C] GGGCGGCACG 3(2) 5(5) 

.16 1508 GCCACGGCGG (C/T J CGCCAGCGAG 8(4) 2(2) 

.20 1608 CCCCCCGGGCIC/GJCGGACCAGCC 6(4) 5(3) 

.23 1713 AGCGGCTGCG [G/TJ GTCCGTGACA 6(3) 3(2) 

.39 4 022 GGCTTCCCCT [G/A) CGCCCTGGGA 3(2) 6(5) 

.43 4187 AAACAGCAGT IT/CJ CCTGGGAACC 12 (10) 2 ( 1) 

.44 4254 TTTCAAAAAA [T/AJ TTTTTTTAAA 2(2) 11 ( 9) 



1498,5 167 GGCGTGCTGA IG/C J TGCCCTGGGA 8(4) 3(3) 



1500.16 2206 GAAGGAAACA [G/A] TGCAACAGCA 16 (13) 2 

.18 2310 GTTGTTAAGA [G/T] TGGGGGAGAG 25 (18) 2 

.23 2426 TGCCAAGCTG [G/A] ACGGCACGAG 10 ( 7) 4 



1501.5 3 88 GCGCTGTGCG [G/T] TGTCCCCGTC 2(2) 2(2) 

.16 1238 CCCCGGGAGG [G/A] AGCTGACTGA 8(8) 2(2) 



1505.9 3 934 TTAGTCATTC [T/C] AAAAAACACC 6(4) 



1507.2 130 CCCCGAGGCG [A/T] TCGTGGAGGA 3(3) 3(2) 



1508.19 5111 CATCGCCGAG [G/C] CCTGGGCCCG 12 (10) 3 ( 2) 



1510.6 1066 CAAAGGAG CT [ T / C ] GAAGG ATATT 2(2) 5(5) 

.8 1136 TCTAAAAGAA [A/G] AAGGAACTAG 3(2) 2(1) 



1511.10 222 CTACAATATT t C/G ] AAAAGGAGTC 18 (11) 2 ( 1) Gin -> Glu 



1514.6 103 CGGGGCTGCG [G/A] CCGCCCGAGG 11 ( 5) 4(4) 

.24 624 GGCATCGTCA [ G /A ] AAGGAAGGGA 13 ( 5) 6(5) 

.35 879 GCTGTAAAAT [T/C] ATAAACTTTT 27 (12) 2 { 1) 

.38 913 TCCCCCAGGG [G/CJ CGAGTTCCTC 25 (11) 3 ( 2) 

.39 914 CCCCCAGGGG [C/G] GAGTTCCTCG 20 (11) 3 { 3) 

.43 1069 AGACCCCAGG [G/T] CAGCATCTCG 21 ( 9) 5(4) 



1515.6 175 CATGCTAGCA [T/G] GGCCTAATGA 3(2) 9(8) Trp -> Gly 

.28 855 CTGGAGAGCT (T/G] GGCTTCCGCG 15 (11) 4 ( 4) Silent 

• 30 858 GAGAGCTTGG [ C/G ) TTCCGCGCTT 6(6) 7(5) Ala -> Gly 

.38 1146 ATAATAAAAG [T/A] TTCATTTGCA 2(2) 23 (14) 3' UT 



1517.9 742 AATCATAATG [G/C] TTCTCCCCTT 6(3) 2(2) Val -> Ala 

.16 1424 AAGTTATTGG [C/T] AAACGAGGTT 11 ( 7) 3(3) Ala -> Val 



1518.8 947 AGAGCTGAGC [G/A] AGTTCACCAC 5(4) 2(2) 



1519.15 1209 CCATCAAAAG [C/T] TTTGAGAATT 2(2) 6(5) Silent 



1520.12 6696 CAGCCTCATC [G/A] ATCCCAAAAC 5(2) 3(1) Asp -> Asn 

.13 6806 TGCGCGGGAG [ C/A] AAACTGCTCT 2(1) 3 { 1) Ser -> Arg 
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1521 


6 


851 


AGACTCTGAG [G/C] CCTGGTGTGA 


7 


( 6) 


2 ( 


2) 


Arg - > 


Ser 




10 


976 


TTGGGAATGG [A/G] TATCAGAAGA 


15 


( 8) 


4 i 


1) 


3' UT 






15 


1165 


TCACCTATAC [A/G J TTATTTAAAT 


20 


( B ) 


4 


1 ) 


3 U i 






17 


1236 


GAAAACTGTG [C/A] AATTGTGTGC 


7 


< 4} 


3 


1) 


3' UT 




1523 


7 


417 


CACCACGGTG [C/T] TGGAATTGTT 


9 


( 8) 


3 


3) 


Silent 




1524 


.13 


2996 


AAAATGACAT [T/G] AGTTTGAAAA 


3 


( 2) 


3 


2) 


3* UT 






.22 


33B4 


AACAGCTTTT [A/T) GGCCAAGCTG 


20 


( 9) 


4 


4) 


3' UT 






.23 


3385 


ACAGCTTTTA [0/A] GCCAAGCTGG 


16 


\ f ) 


c 
o 


b ) 


3 ' UT 






.25 


3397 


CCAAGCTGGC [C/T] TGACGGTATG 


25 


(ID 


4 


3) 


3' UT 




.26 


3398 


CAAGCTGGCC [T/Ql GACGGTATGG 


25 


1X1/ 


i 
j 


2 ) 


3 ' UT 




1526 


.6 


2476 


TGGAGGTGCA IT /CI AACCTACTTA 


■5 
*• 


I T \ 

\ l ; 


2 


1 ) 


Silent 






.7 


2715 


GTGAAAGGGG [A/CJ CGTGTACTCT 


2 


( 2) 


3 


1) 


Asp - > 


Ala 


1528 


.6 


770 


CCAAAAGGAA [G/A] TGAATCAGCA 


2 


( 2) 


2 


2) 


Val -> 


Met 




.10 


2396 


GCAGTGCGCA TA/Tl TCCTGGACC"*t* 


n 
J. 


/ i \ 
\ l J 


4 


4 ) 


Val - > 


Phe 




.26 


3317 


TTCAAGTGAA [ 0/C ] ATGCTGAAAG 


12 


( B) 


7 


6) 


Asp - > 


His 




.32 


3598 


TATAATTAGT [T/C] ATGACAGCCA 


19 


(16) 


2 


1) 


3' UT 




1530 


.8 


427 


ATCCGCCCCC [A/G] CGACGTCCCC 


4 


( 3) 


2 


1) 


Thr -> 


Ala 




.13 


894 




8 


( 5 ) 


2 


1 ) 


Ser - > 


Glu 




.30 


1579 


AGTCCTGAAA [G/A] GCCCAAGGCC 


4 


( 3) 


7 


6) 


3' UT 




1532 


.6 


496 


TCGTGCGCAA [C/T) GTGCCCTGGG 


4 


( 2) 


6 


( 3) 


Silent 






.10 


963 


CTGGCCTTAT [G/T] CCCAGGCCTG 


6 


( 4) 


2 


2) 


Cys - > 


Phe 


1533 


. 12 


2092 




3 


( 3 ) 


2 


'. 2 ) 


Asp - > 


Ala 


1534 


.4 


264 




■5 




r 

b 


I 4 ) 


Silent 




1536 


.22 


6641 


TTAGATATAT f A/Ql TATTC'ATT fT 


3 




4 




3 ' UT 






.24 


6779 


ATTTTTATTG [G/A] GCCCAAAAAC 


2 


( 2) 


11 


( 8) 


3' UT 






.26 


7097 


AGTGGAATGT £T/A] TAAAAAAAAA 


4 


( 3) 


4 


( 3) 


3' UT 




1537.5 


871 


AGGGCAGTGC [C/A] ATTGATAGGA 


7 


< 6) 


3 


( 3) 


Silent 






.10 


1466 


GCAGGCATGC [C/A] AGTCTCTGCC 


7 


( 7) 


3 


( 3) 


3' UT 




1538 


.21 


938 


CCTCCACCTT [T/C] GACGCTGGGG 


14 


( 7 ) 


3 




S i 1 ent 




1539 


.1 


67 


TCGCGGCCTA [G/C] CTTTACCCGC 


3 


( 3 ) 


2 




2 Ui 






.3 


304 


TCGATGGCTC [T/C] AGTACTTTAC 


4 


( 4) 


4 


( 3) 


Silent 






. 9 


1075 


GTAGCGCCAG [A/C] CTACGCATTC 


2 


( 2) 


3 


( 2) 


Arg - > 


Ser 




.16 


2048 


CAAGGAAGTG [G/A] TTCTTAGATG 


8 


( 7) 


4 


( 2) 


3 ' UT 






.21 


2718 


GCCTAACATAA [ A/ G ] AAAAAAAAA 


8 


( 8) 


3 


( 3) 


3 1 UT 




1541 


.1 


4123 


TGGCGAGGGG [G/C] CTTGACGGCG 


2 


( 1) 


2 


( 2) 


3 1 UT 




1543 


.4 


319 


GCACCGGAAG [G/A] AGGCGCTGAC 


6 


( 5) 


2 


( 2) 


Ser - > 


Lys 


1544 


.3 


534 


TTGAGCCCAA [ C/G] TGCTTGGACG 


2 


( 2) 


7 


( 4) 


Asn - > 


Lys 




.4 


543 


ACTGCTTGGA [C/T] GCCTTCCCAA 


4 


( 4) 


7 


( 4) 


Silent 




.8 


643 


ACCTGTGTTC [T/A] CAAAGATGGC 


12 


( B) 


3 


( 3) 


Ser -> 


Thr 




.12 


728 


GCTGCCCAGG [C/G] TGTGCAGCGC 


12 


(11) 


4 


( 1) 


3 ' UT 






.21 


902 


AACATCCCCT [C/T] CCATCATTAC 


5 


( 4) 


4 


( 2) 


3 ' UT 






.22 


986 


CTGCCTGGCC [C/T] CTCGCCTGTG 


5 


( 4) 


2 


( 2) 


3 ' UT 




1545 


.4 


1470 


CGGTGAGACC [G/A] TTGCCCGCTG 


2 


( 1) 


2 


( 2) 


Val -> 


He 


1546 


.1 


172 


CTCTGAAGAC [A/T] TGGAGATACT 


3 


( 1) 


3 


( 3) 


Met - > 


Leu 
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1547.17 976 TGCTTTAAAG (G/A] GCCTGCCTGG 13 (10) 2 ( 2) 



1548.3 1209 CATTATTGGC [C/T] TCATCAAACC 3(3) 3(1) 

.4 1286 TGAAAGGTGT [A/G] AATAAGTTAC 2(2) 3(2) 

.8 1904 ATAACTAAGA [C/T] TTCTGTGCAT 6(3) 5(3) 



1550.7 797 TGGACGCCTT [T/C] CCAAATCTGA 2(2) 5(2) 



1551.12 2215 CGAGACCATC[T/C)TGGCCCCTCC 3(1) 10 ( 9) 

.14 2242 TGCCTGAGCC [T/C) AGGAGCTTGA 3(1) 9(8) 

.15 2341 ACTGGGTCTC (G/Aj CTCCGAGTGG 3(1) 9(8) 

.16 2372 GGAGGGAGGG [T/A] CAGGGGGAGG 3(1) 9(8) 



1554.12 834 GGGACTTTAT [C/GJ GATTGCTTCC 6(5) 2(1) lie -> Met 

-14 999 ACCCAGAGGT [ C/G] ACAGCTAAAG 8(6) 2(1) Silent 

.23 1539 ATCTGGCTGC [T/CJ GATCTGCTAT 5(4) 5(4) 3 • UT 



1555.5 424 TATGGATGCC [A/G] AGCACCACAA 17 ( 8) 3(1) 

.9 515 GCCAGCACCA [G/C] CCAGGAGCTG 17 ( 7) 3(3) 

.30 1088 TCCTCGGCTG [C/A] GTTCAGTCCT 2(2) 8(5) 



1556.7 2037 TGATCTTTGC [C/T] CCTGGTATGC 5(5) 5(3) 3 ' UT 



1560.7 2335 GCATTCAAGA [C/T] GGATACAGAG 5(5) 2(1) Thr -> Met 



1561.1 90 CTGTGCTGCCtC/TjGGCTCCCCCA 2(2) 2(2) Silent 

.5 373 CCCTGACATC [A/G] TGGAGTTCTG 2(1) 2(2) Met -> Val 

.22 1250 TGTTTCCTTT [T/G] GGGCTCAAAG 8(7) 4(4) 3 ' UT 

.23 1251 GTTTCCTTTT [G/T] GGCTCAAAGC 7(6) 4(4) 3 * UT 



1562.14 540 ATTGTCGCAC [C/T] TCCTACACCT 21 ( 9) 2(1) Silent 

.30 799 AGCCATGAGT [G/T] GGGCTGGGCC 14 ( 7) 3(3) Gly -> Trp 



1563.10 3 076 ACTCCCCTTC [A/G] TGAAACCAGA 2(1) 2(2) Met -> Val 



1564.7 339 CTTTGGAAAG [T/C] GTGAAAGCTG 15 (10) 2 ( 1) Silent 



156 6.2 53 GCAGGCACAG [T/C] GTCACCTTCG 2(1) 2(2) 5 * UT 

.4 175 TCCTGGCGGC [G/A] CCTCGTGTGC 3(1) 4(4) Arg -> His 

.10 791 GCATGAATCC [C/T] GGCCCAGGCG 3(1) 4(4) Silent 

.23 1741 TGCACTCTGT [G/C] CTCCGCCCAA 3(2) 3(2) Cys -> Ser 

.24 1742 GCACTCTGTG [C/G] TCCGCCCAAG 3(2) 3(2) Cys -> Trp 



1567.2 1083 GGAATACTGG (G/A) AGAATCTTCG 5(3) 2(1) Ser -> Lys 



1571.4 14 80 AGAGAAAATT [G/A] GGGAAAAGGT 4(4) 3(2) 3 * UT 

.14 2087 TCTGTCTGGT [G/A] TGGTATGAAT 5(5) 4(2) 3 * UT 



1576.13 1777 CGCCCCTCCC [C/T] CCTCTGGCCC 3(2) 2(2) 3 • UT 

.16 2031 AATTGTACATTC [C/T] CTGCATCC 3(2) 2(2) 3 ' UT 



1577.10 3022 TGCCGGCCGG [A/G] ACCCAGCGGC 2(2) 6(5) Asn -> Asp 

.15 3229 CACACCACCG [T/C] CCTCCTCGCT 2(2) 5(4) 3 ' UT 

.33 3859 GGTAGCCACC [G/A] CCGGGGCACT 18 (13) 4(3) 3 ' UT 

.38 3980 CTGATGCATC [G/A] TTTTCTTTGC 18 (14) 4(3) 3 1 UT 

.4 7 404 9 GCCAGGCCAT [G/T] GCCAAGGGGC 7(6) 3(3) 3 ' UT 

.50 4055 CCATGGCCAA [G/A] GGGCCAGCTG 5(5) 5(5) 3 ' UT 



1578.5 178 TACTTCGACC [G/A] CAAAAGACGA 7(7) 2(2) Arg -> His 

.12 4 51 CTTCCACCAC [C/T] AGTGTTCCAG 8(6) 3(2) Pro -> Leu 
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13 


468 


CCAGATGCTT [C/T] TGACTAAGCT 


8 


< 6) 


3 


( 2) 


Silent 




15 


501 


TCAGAGAATT [G/C] TAAGTGCTCA 


5 


( 5} 


2 


( 2) 


Val -> 




17 


551 


AAACAAATGT [C/T] AACA7AATAA 


5 


< 5) 


4 


( 3) 


3' UT 




19 


630 


GGGCAAATAT [G/C] CTTG7TATGA 


7 


( 6) 


2 


( 2) 


3 ' UT 




20 


683 


CTTTGTGTAG [A/G] TCCA7TTGTC 


9 


( 7) 


2 


( 2 ) 


3 1 UT 




25 


2725 


AGGTGAGAAC [A/G) AAAAAACCCC 


6 


( 5) 


3 


( 3) 


3 ' UT 


1579 


15 


1735 


GCTGCAGCGG [C/T] TGGCAGACGG 


17 


(12) 


2 




Silent 




19 


1881 


GGATC CGAGA [G/A] GGCATGGCCG 


14 


(12) 


5 


( 5) 


Ser -> 




26 


2010 


GAATACTCCC [G/C] GCCAGGGCCT 


12 


(10) 


17 


M ft \ 


3 ' UT 


1581 


2 


1897 


CCGCTAAAAT [G/A] AGAATAAGGT 


3 


( 3) 


5 


( 4) 


Met -> 


.5 


2232 


TGAATGTAAC [T/C] GCTTTAAGAA 


3 


( 3) 


5 


( 5) 


3 1 UT 


1583 


7 


1482 


AAGACACAGA TA/Tl GGAfW^CCCfi. 




v s ; 


3 


( 2 ) 


Glu - > 




11 


1772 


GCTTTTAATA [G/C] TGTCATAAAG 


3 


( 3) 


2 


( 2) 


3' UT 


1584 


18 


576 


CGCCCTCACA [G/A] CCTCCTTCTG 


2 


< 2) 


2 


( 2) 


Silent 




34 


1098 


ATATGGATGG f C/T 1 RnTAPrTTfa 








I 2 ) 


Ala - > 




46 


1708 


GAGAAACCCC [C/T] GGGGACCATG 


3 


( 3) 


2 


( 2) 


3' UT 




50 


1846 


GAGGGATTGA [G/A] CACAGGCACA 


2 


( 2) 


6 


( 6) 


3' UT 




51 


1857 


AGCACAGGCA [ C/A] AGAGGTGCTG 


2 


I 2) 


6 


( 6) 


3' UT 


1587 


11 


1330 


GCCTGCGTGG [G/C] AACTCATGCA 


7 


( 2) 


11 


(10) 


Glu -> 




.12 


1356 


* rtOrtrtV— w i 1 J O/Wv 1 1 LLL/\L 


1 8 




2 


I 2 ) 


Silent 


1588 


.26 


1956 




7 




4 


( 3 ; 


3 ' UT 


1590 


.2 


172 


TGCACGCAGC [C/A] ATGGCTGACA 


6 


( 3) 


2 


( 2) 


Silent 




.7 


547 


CGCTGGATAA [C/T] GCCTACATGG 


B 


( 4) 


2 


( 2) 


Silent 




.9 


850 


TCATCCGCAA [G/A] GCATCTGATG 


4 


( 2) 


2 


< 2) 


Silent 




.33 


2139 


AGCCGACTCT f G/Tl GCCCTGGCdC 


14 


( 9 ) 






3 ' UT 


1594 


.10 


1730 


ACCCCAGTGG [G/A] AACTGTGCAA 


6 


{ 5) 


2 


( 2) 


3' UT 




.13 


1975 


GAAACTAACT [C/T] GGTGGCCCCA 


6 


( 5) 


9 


( 6 ) 


3 ' UT 




.14 


1985 


CGGTGGCCCC [A/G] ACAGGTCTTC 


6 


( 5) 


9 


< 6) 


3 ' UT 


1596 


.3 


1773 


TGATGTGGTA [C/T] GTCCCTCCAC 


10 


( 7) 


3 


( 2) 


3* UT 




.6 


1844 


GTATTCACCA [A/C] GCATTTTAGG 


10 


( 8) 


4 


< 3) 


3' UT 




.11 


1899 


GCATTTACAA [G/A] GCACTGCCAA 


17 


(12) 


3 


( 3) 


3 ' UT 




.12 


1900 


CATTTACAAG [G/T] CACTGCCAAA 


19 


(12) 


2 


( 2) 


3 1 UT 




.16 


1949 


AGAGGACCTG [C/T] GGGCTTAGAT 


24 


(16) 


2 


( 1) 


3 1 UT 


1598 


. 3 


2042 


ATGCCTAAGA (C/A] CAACTGCGTT 


2 


( 2) 


3 


( 1) 


3 ' UT 


1603 


.5 


592 


TCTGTGGCAC [T/C] GATATGACCA 


2 


( 2) 


2 


( 2) 


5' UT 




.16 


2566 


TGAAACTGAG [G/C] CCCATCCTCA 


17 


(12) 


2 


( 2) 


Arg -> 




.18 


2662 


CCGGGGAAGC [T/G] GCCGTCTAAA 


13 


(11) 


3 


( 3) 


Silent 




.28 


2953 


TTAGAATTTT [C/T] CTAAAAATAA 


26 


(18) 


2 


( 1) 


3' UT 


1605 


.14 


2879 


AACACGGCCC [T/C] GCTGTCGCTG 


2 


( 2) 


2 


( 1) 


Leu -> 




.30 


4011 


AATTTAAAGT [A/C] TTCTCCTCCC 


4 


{ 2) 


6 


< 6) 


3' UT 


1607 


.13 


2354 


CTTTCTCTGG [C/T] CCTGTCCATG 


9 


( 8) 


2 


( 2) 


3' UT 


1608 


.3 


2120 


CAGCCGCCAT [T/C] TGCAAGGAAG 


2 


( 2) 


2 


( 2) 


3' UT 




.11 


2552 


CAAAAGATGA [G/T] TCCTTGCTTC 


16 


( 9) 


4 


( 3) 


3' UT 




.17 


2733 


TCCTAAGCAG [T/C] CCTGGCTTTT 


25 


(11) 


3 


( 3) 


3' UT 




.01 


2091 


CTCCTTCCAA [C/T] CCCACTCCCC 


65(36) 


7(7) 


3 'UT 




.02 


2120 


CAGCCGCCAT [T/C] TGCAAGGAAG 


25 (18) 


47 (40) 


3 'UT 




. 04 


2578 


GAAATAAAAG [T/G] AGCCCAGCTG 


26(19) 


46 (29) 


3 1 UT 




.05 


969 


AACCTAGTGC [G/A] ACCAAGGGAA 


69(36) 


3 (3) 


Silent 
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.06 2174 CCTCTCCCAG[C/T]GGCCTCCCCC 71(36} 1(1) Silent 

.07 2129 TTTGCAAGGA [A/G] GGCCTAATCA 66(36) 6(6) Silent 
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Targat Loc'n Sequence around « Varia 1 # Varia 2 Protain 



ID 






(polymorphism) (Lib) 




(Lib) 


Change 




1613 


2 


350 


AGTGGCCATG [G/A] TTGGGTCAGC 


10 


( 7) 


3 


( 3) 


Val -> 




.11 


842 


T GAT CAT CAT [T/C] TCCTTGCGGA 


3 


( 3) 


6 


{ 4) 


3 ■ UT 


1614 


.5 


1343 


CCTATCTGGA [T/C] ACATTTGGCC 


2 


( 2) 


3 


( 3} 


Silent 




.13 


1641 


CGGCGGTGGA [G/A] GCTGAGCGCC 


10 


( 9) 


2 


( 2) 


Ser -> 




.23 


2158 


TTTTTTTTTT [T/A] AAAAAAGAAA 


7 


( 7) 


8 


( 5) 


3' UT 




.28 


2261 


CTGAAGTCTA [G/A] GATATTTTTC 


5 


( 5) 


2 


( 2) 


3' UT 



lie 



1615.25 2113 CCTGGCCATC [T/C] TGGGCAGTGT 16 (11) 7 ( 5) Silent 
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Research Collaboration 



A ACAGCCACTTATGTCATGGT 

B ACAGCCACTTATGTCATGGT 

C A CAGCC ACTTATGT CATGGT 

D CACTTATGTCATGGTATTCA 

E CACTTATGTCATGGTATTCA 

F CACTTATGTCATGGTATTCA 



Improved Allele-Specificity with 
Advanced Chemistry 
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Q 2 base mismatch 
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Effect of Antisense Oligomers on Glutamyl- 
prolyl-tRNA Synthetase (EPRS) mRNA levels 

(duplicates) 



cone 



14977 
(100% matched) 

0 50 100 200 300 400 




14971 
(2 mismatches) 



0 50 100 200 300 400 nM 




'circled samples were switched 
when loaded on to the gel 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1, 13, 21, 29, 37, 38, 53, 54, 69, 77-79, 
101 and 109 



Inhibitors (in particular nucleic acids) targeting at least 
one but less than all alleles of a gene vital for cell 
growth or viability and encoding a product required for cell 
proliferation wherein cells not targeted by the inhibitor 
have at least one alternative variant allele, methods for 
identifying, producing and using such inhibitors and 
pharmaceutical compositions comprising them. 



2. Claims: 2, 14, 22, 30, 39, 40, 55, 56, 70, 80-82, 
102 andllO 



Inhibitors (in particular nucleic acids) targeting at least 
one but less than all alleles of a gene vital for cell 
growth or viability and encoding a product required to 
maintain inorganic ions and vitamins at levels compatible 
with cell growth or survival wherein cells not targeted by 
the inhibitor have at least one alternative variant allele, 
methods for identifying, producing and using such inhibitors 
and pharmaceutical compositions comprising them. 



3. Claims: 3, 15, 23, 31, 41, 42, 57, 58, 71, 83-85, 
103 and 111 



Inhibitors (in particular nucleic acids) targeting at least 
one but less than all alleles of a gene vital for cell 
growth or viability and encoding a product required to 
maintain organic compounds at levels compatible with cell 
growth or survival wherein cells not targeted by the 
inhibitor have at least one alternative variant allele, 
methods for identifying, producing and using such inhibitors 
and pharmaceutical compositions comprising them. 



4. Claims: 4, 16, 24, 32, 43, 44, 59, 60, 72, 86-88, 
104 and 112 



Inhibitors (in particular nucleic acids) targeting at least 
one but less than all alleles of a gene vital for cell 
growth or viability and encoding a product required to 
maintain cellular proteins at levels compatible with cell 
growth or survival wherein cells not targeted by the 
inhibitor have at least one alternative variant allele, 
methods for identifying, producing and using such inhibitors 
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and pharmaceutical compositions comprising them. 
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Inhibitors (in particular nucleic acids) targeting at least 
one but less than all alleles of a gene vital for cell 
growth or viability and encoding a product required to 
maintain cellular nucleotides at levels compatible with cell 
growth or survival wherein cells not targeted by the 
inhibitor have at least one alternative variant allele, 
methods for identifying, producing and using such inhibitors 
and pharmaceutical compositions comprising them. 



6. Claims: 6, 18, 26, 34, 47, 48, 63, 64, 74, 92-94, 
106 and 114 



Inhibitors (in particular nucleic acids) targeting at least 
one but less than all alleles of a gene vital for cell 
growth or viability and encoding a product required to 
maintain the integrity and function of celular and 
subcellular structures wherein cells not targeted by the 
inhibitor have at least one alternative variant allele, 
methods for identifying, producing and using such inhibitors 
and pharmaceutical compositions comprising them. 



7. Claims: 7-10, 19, 27, 35, 49, 50, 65, 66, 75, 95-97 and 107 



Inhibitors (in particular nucleic acids) targeting at least 
one but less than all alleles of a gene vital for cell 
growth or viability and being located on a high frequency 
loss of heterozygosity chromosomal arm region, wherein cells 
not targeted by the inhibitor have at least one alternative 
variant allele, methods for identifying, producing and using 
such inhibitors and pharmaceutical compositions comprising 
them. 



8. Claims: 11, 12, 20, 28, 36, 51, 52, 67, 68, 76, 
98-100 and 108 



Inhibitors (in particular nucleic acids) targeting at least 
one but less than all alleles of a gene vital for cell 
growth or viability having at least two sequence variances 
which occur at frequences such that at least 10% of a 
population is heteroziguous for that gene and wherein cells 
not targeted by the inhibitor have at least one alternative 
variant allele, methods for identifying, producing and using 
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such inhibitors and pharmaceutical compositions comprisinq 
them. 



9. Claims: 115-129 

Inhibitors (in particular nucleic acids) targeting at least 
one but less than all alleles of a gene conditionally vital 
for cell growth or viability wherein cells not targeted by 
the inhibitor have at least one alternative variant allele, 
methods for identifying, producing and using such inhibitors 
and pharmaceutical compositions comprising them. 



10. Claims: 131-146 

Methods using inhibitors targeting at least one but less 
than all alleles of a gene vital for cell growth or 
viability wherein cells not targeted by the inhibitor have 
at least one alternative variant allele related to 
transplantation and engraftment. 



11. Claims: 147-150 

Methods for separating a cell from a mixture using allele 
specific binding compounds targeting at least one but less 
than all alleles of a gene wherein cells not targeted by the 
compound have at least one alternative variant allele. 
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